(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 




(19) World Intellectual Property Oi^anization 

International Bureau 

(43) International Publication Date (10) International Publication Number 

8 February 2001 (08.02.2001) pCT WO 01/09300 A2 



(51) International Patent Classification^: C12N 15/00 



(21) International Application Number: PCT/NLOO/00534 



(22) International Filing Date: 

(25) Filing Language: 

(26) Publication Language: 



27 July 2000 (27.07.2000) 



English 



English 



(30) Priority Data: 

99202540.3 



2 August 1999 (02.08. 1999) EP 



(71) Applicant (for all designated States except US): KEY- 
GENE N.V. [NL/NL]; P.O. Box 216, NL-6700 AE Wa- 

geningen (NL). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): FIERENS-ON- 
STENK, Bernarda, Gerharda, Johanna [NL/NL]; 
Vrekenhorst 91, NL-3905 VS Veenendaal (NL). DE 
BOTH, Michiel, Theodoor, Jan [NL/NL]; Gruttoweide 
199, NL-6708 EG Wageningen (NL). 



(74) Agent: JORRITSMA, Ruurd; Nederlandsch Octrooibu- 
reau, Scheveningseweg 82, P.O. Box 29720, NL-2502 LS 

The Hague (NL). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CR, CU, CZ, 
DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, HR, 
HU, ID, IL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, LR, 
LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, MZ, 
NO, NZ, PL, PT, RO. RU, SD, SE, SG, SI, SK, SL, TJ, TM, 
TR, TT, TZ, UA, UG, US, UZ, VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE), OAPI patent (BF, BJ, CF, CG, 
CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 

Published: 

— Without international search report and to be republished 

upon receipt of that report. 

For two-letter codes and other abbreviations, refer to the "Guid- 
ance Notes on Codes and Abbreviations " appearing at the begin- 
ning of each regular issue of the PCT Gazette. 



(54) Title: METHOD FOR GENERATING RESISTANCE AGAINST CGMMV IN PLANTS, GENETIC CONSTRUCTS FOR 
USE IN SAID METHOD, AND CGMMV-RESISTANT PLANTS OBTAINED VIA SAID METHOD 

^5 (57) Abstract: The present invention relates to a method for generating resistance against Cucumber Green Mottle Mosaic Vims 
2^ (CGMMV) in plants, in particular in plants susceptible to infection by CGMMV, such as Cucurbitaceae species, including melon, cu- 
^ cumber, watermelon and bottlegourd. In the invention, the plants are provided with replicase-mediated resistance against CGMMV. 
For this purpose, the plants are transformed with a polynucleotide sequence that: i) has been derived from the 129 kD sequence, 
the 57 kD sequence, or the 186 kD readthrough sequence of native CGMMV; ii) upon (at least) transformation into the plant and 
transcription into RNA - and usually also translation into the corresponding encoded protein - can provide the plant with resistance 
against CGMMV; but iii) does not encode any replicase activity. In particular, this will be a nucleotide sequence that encodes a 
defective variant of the replicase of CGMMV. The invention ftuther relates to genetic constructs containing such a pol3mucleotide 
sequence suitable for use in the above method, and to CGMMV-resistant transgenic plants obtained via said method. 
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Method for generating resistance against CGMMV in plants, genetic constructs for use in said 
method, and CGMMV -resistant plants obtained via said method. 

The present invention relates to a method for generating resistance against Cucumber 
5 Green Mottle Mosaic Virus (CGMMV) in plants, in particular in plants that are susceptible to 
infection by CGMMV, such as species of the Cucurbitaceae family. 

The invention further relates to genetic constructs suitable for use in said method, and 
to CGMMV-resistant transgenic plants obtained via said method. 

Methods of introducing DNA sequences into the genome of plants have been known 
1 0 for many years and have been widely used to alter the properties of plants varieties. Such 
methods are among others Agrobacterium-medvaXtdi transformation (Horsch etal, 1985; 
Rogers et al, 1986), protoplast transformation using electroporation or other techniques to 
introduce naked DNA molecules into the plant cell (Shillito et al. 1985), and particle 
bombardment to introduce naked DNA molecules into plant cells or tissues (Christou et al, 
15 1994). 

Among the most important applications of plant genetic engineering are those aimed at 
introducing resistance genes to a wide variety of plant pests and plant pathogens, such as 
bacteria, fungi, nematodes, insects and viruses. Many examples of virus resistance in a wide 
variety of plant species have been described over the last decades (Wilson et al, 1993). The 

20 various methods to obtain virus resistance in plants through the introduction of gene 

sequences are either based on the use of genes of plant origin; on the use of sequences/genes 
derived from the viral pathogen itself (so-called pathogen-derived resistance (Wilson et al, 
1993 ), or on the use of genes of yet different origin. Sequences originating from the viral 
genome can be either cloned or PCR-amplified DNA sequences obtained from the genome of 

25 DNA viruses, such as geminiviruses (Kunik et al , 1 994) or the cDNA sequences obtained 
from the genomes of RNA viruses through the use of cDNA cloning or RT-PCR 
amplification. 

Examples of sequences/genes of RNA viruses that have been succesfully used in the 
engineering of virus resistance in plants include: 

30 1 . coat protein genes of tobamo viruses, cucumoviruses, potyviruses, potexviruses 

(Beachy etal, 1990); 
2. RNA-dependent RNA polymerase genes (replicase genes) of tobamoviruses, 
cucumoviruses, potyviruses (Anderson et al, 1992; Donson et al, 1993; Audy et 
al, 1994); 
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3. nucleoprotein genes of tospoviruses (Goldbach and De Haan, 1993; Prins et al, 
1994; Vairaeffl/., 1995); 

4. movement protein genes of tobamoviruses and cucumoviruses (Cooper et al. , 
1995). 

5 Cucumber Green Mottle Mosaic Virus (CGMMV) is a member of the tobamovirus 

group and infects plant species of the Cucurbitaceae family: melon (Cucumis melo), 
cucumber {C.sativus), watermelon {Citrullus vulgaris) and bottlegourd (Lagenaria siceraria), 
but not apparently Cucurbita pepo (squash, pumpkin, courgette). The host range of the virus 
is basically restricted to members of the Cucurbitaceae, with the exception that some but not 

1 0 all strains also infect tobacco and/or the diagnostic species Datura stramonium and 
Chenopodium amaranticolor (Rollings et al. , 1 975). 

Several different strains can be distinguished serologically and by their response in 
C. amaranticolor and D. stramonium (Rollings et al., 1975) The ''type strain" was originally 
identified in Europe and does not normally cause fi-uit symptoms in cucumber. Another 

15 European strain, called the cucumber aucuba mosaic strain, cucumber virus 4 or Cucumis 
virus 2A causes fruit symptoms in cucumber. A number of strains are knov,Ti from Japan. In 
watermelon, the watermelon strain causes serious disease, whereas the Japanese cucumber 
strain (also called Kyuri Green Mottle Mosaic Virus) and the Yodo strain cause fruit 
distortions in cucumber. The CGMMV-C strain from India is a pathogen on bottlegourd and 

20 serious infections can cause complete crop losses. 

In cucumber, CGMMV causes vein clearing, a light and dark green leaf mottle, leaf 
blistering and malformation and stimted growth, seriously affecting fruit yield. The East 
European isolates of the aucuba mosaic strain produces bright yellow leaf mottling and fruit 
discolouration. 

25 CGMMV is transmitted through seed, but mostly through mechanical infection via the 

roots in contaminated soil, and through foliage contact and handling of plants (HoUings et al. , 
1975). The virus particles are extremely stable and survive several months at normal 
temperatures. This stability combined with the very high infectivity through mechanical 
contact of the foliage is responsible for the economic importance of this virus, as even one or 

30 a few infected plants in a cucumber greenhouse can eventually cause the infection and loss of 
the total crop. Also, infection may not only spread rapidly over a current crop, but also - due 
to the strong persistance of the virus - affect subsequent crops. Therefore, a CGMMV 
infection may require sterilisation of an entire greenhouse, as well as the use of sterile tools 
and materials. 
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The complete sequence of only one isolate of CGMMV has been determined (Ugaki et 
al, 1991 ; Genbank accession numbers D12505 and D01188). This isolate "SH" had been 
found in infected watermelon plants in East Asia. Furthermore, the sequence of the coat 
protein gene of one other isolate ("W") obtained from infected watermelon is known (Meshi 
5 et al, 1983; Genbank accession numbers V01551 and J02054), as well as the sequence of the 
29 kD movement protein gene of a watermelon strain (Saito et al, 1988; Genbank accession 
number J04332). The nucleotide sequence of the CGMMV-SH isolate shows 55 to 56% 
identity with tobacco mosaic virus (TMV) and tobacco mild green mosaic virus (TMGMV), 
both other members of the tobamovirus group (Ugaki et al, 1991). 

1 0 As described by Ukagi et al , the genome of CGMMV consists of a single-stranded 

RNA molecule coding for at least four open reading frames, encoding putative proteins of 1 86 
kD, 129 kD, 29 kD and 17.3 kD, of which the 17.3 kD ORF is known to encode the coat 
protein. In this respect, Ugaki et al state: 'Wo CGMMV-encoded proteins except for the coat 
protein have yet been identified in vivo". 

15 The CGMMV genome is schematically shown in Figure 1 . As can be seen therein, the 

ORF encoding the 186 kD protein starts at the same site as the ORF encoding the 129 kD 
protein, and adds a putative 57 kD polypeptide to the 129 kD ORF. The presence of this 57 
kD protein alone has not been detected in infected plants. Instead, the 1 86 kD protein has 
been found, being the product of a read-through translation of the 129 kD and the 57 kD 

20 ORFs. 

This 186 kD protein is thought to play a role in virus replication. Also, the 129 kD 
ORF is thought to encode a replicase function, whereas the 29 kD ORF is thought to encode a 

movement protein. 

Hereinbelow, the nucleotide sequence corresponding to the ORF encoding the 129 kD 
25 protein will be referred to as "729 kD sequence", the sequence corresponding to the 186 kD 
readthrough protein will be referred to as ''186 kD sequence", and the nucleotide sequence 
corresponding to the ORF encoding the 57 kD readthrough part will be referred to as "57 kD 
sequence". These nucleotide sequences and the corresponding protein sequences are given in 
the sequence listings, as further described below. 
30 Object of the invention was to provide a method for protecting plants, in particular 

plants susceptible to infection with CGMMV such as species of the Cucurbitaceae family, 
against infection with CGMMV, and in particular against infection with strains of CGMMV 
prevalent in Europe, such as the strains encountered in the cultivation of cucumbers in 
greenhouses. 
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Further objects were to provide means for use in said method, in particular a genetic 
construct that can be used for transforming plants or plant material so as to provide transgenic 
plants resistant against infection with CGMMV. Further objects of the invention will become 
clear from the description given hereinbelow. 
5 For these purposes, applicant has investigated the symptomatology and the nucleotide 

sequence of the coat protein genes of 10 European strains of CGMMV, and compared these 
with the SH strain described by Ugaki et al. A list of these strains, with their geographical 
origin and symptoms on cucumber, is given in Table 1 . 

10 Table 1 . List of collected CGMMV-isolates with their geographical origin and symptoms on 
cucumber. 



CGMMV isolate 


geographical 
origin 


symptoms on cucumber 


1 


Eastern Europe 


vein clearing, mosaic 


2 


Eastern Europe 


verin clearing, mosaic 


3 


IPO-DLO, the 
Netherlands 


almost without symptoms 


4 


the Netherlands 


weak leaf chlorosis 


5 


the Netherlands 


weak leaf chlorosis 


6 


Proefstation 
Naaldwijk, the 
Netherlands 


chlorosis 


7 


Rijk Zwaan, the 
Netherlands 


chlorosis 


8 


Israel 


chlorosis 


9 


Almeria, Spain 


chlorotic leaf spots 


10 


Almeria, Spain 


weak leaf chlorosis 


CGMMV-SH 


Japan 


strong chlorotic leaf mosaic 



15 

It was found that the sequences for the 10 European isolates are highly homologous 
( i.e. homology on the nucleotide level of 97%), and show about 90 % homology (on the 
nucleotide level) with the SH-isolate. The nucleotide sequences encoding the coat proteins of 
each of the isolates 1-10, as well as strain SH, are given in the sequence listings, as further 
20 described below. The corresponding phytogenetic tree is shown in Figure 2. This shows that 
the European isolates can be considered to constitute a subgroup of the CGMMV species. 

In the sequence listings: 
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SEQ ID no.l gives the nucleotide sequence encoding the 129 kD replicase protein of 
CGMMV isolate 4, with the ORF of the coat protein starting with the ATG codon at bp 
523-525; 

- SEQ ID no.2 gives the amino acid sequence of the 129 kD replicase protein of CGMMV 
5 isolate 4, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

- SEQ ID no.3 gives the nucleotide sequence encoding the 57 kD protein of CGMMV 
isolate 4, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 
SEQ ID no.4 gives the amino acid sequence of the 57 kD replicase protein of CGMMV 
isolate 4, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

10 - SEQ ID no. 5 gives the nucleotide sequence encoding the 186 kD readthrough protein of 
CGMMV isolate 4, with the ORF of the coat protein starting with the ATG codon at bp 
523-525; 

- SEQ ID no. 6 gives the amino acid sequence of the 186 kD readthrough protein of 
CGMMV isolate 4, with the ORF of the coat protein starting with the ATG codon at bp 

15 523-525; 

SEQ ID no. 7 gives the nucleotide sequence encoding the coat protein of CGMMV isolate 

1, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

- SEQ ID no. 8 gives the nucleotide sequence encoding the coat protein of CGMMV isolate 

2, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

20 - SEQ ID no.9 gives the nucleotide sequence encoding the coat protein of CGMMV isolate 

3, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

- SEQ ID no. 1 0 gives the nucleotide sequence encoding the coat protein of CGMMV 
isolate 4, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 
SEQ ID no. 1 1 gives the nucleotide sequence encoding the coat protein of CGMMV 

25 isolate 5, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

- SEQ ID no. 1 2 gives the nucleotide sequence encoding the coat protein of CGMMV 
isolate 6, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

- SEQ ID no. 1 3 gives the nucleotide sequence encoding the coat protein of CGMMV 
isolate 7, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

30 - SEQ ID no. 14 gives the nucleotide sequence encoding the coat protein of CGMMV 

isolate 8, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 
SEQ ID no. 1 5 gives the nucleotide sequence encoding the coat protein of CGMMV 
isolate 9, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 
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- SEQ ID no. 16 gives the nucleotide sequence encoding the coat protein of CGMMV 
isolate 10, with the ORF of the coat protein starting with the ATG codon at bp 523-525; 

- SEQ ID no. 17 gives the nucleotide sequence encoding the 129 kD replicase protein of 
CGMMV isolate SH; 

5 - SEQ ID no. 1 8 gives the amino acid sequence of the 129 kD replicase protein of CGMMV 
isolate SH; 

- SEQ ID no. 1 9 gives the nucleotide sequence encoding the 57 kD protein of CGMMV 

isolate SH; 

- SEQ ID no. 20 gives the amino acid sequence of the 57 kD replicase protein of CGMMV 
10 isolate SH; 

- SEQ ID no.21 gives the nucleotide sequence encoding the 1 86 kD readthrough protein of 
CGMMV isolate SH; 

- SEQ ID no.22 gives the amino acid sequence of the 1 86 kD readthrough protein of 
CGMMV isolate SH; 

15 - SEQ ID no.23 gives the nucleotide sequence encoding the coat protein of CGMMV 
isolate SH; 

- SEQ id's nos. 24-40 give the nucleotide sequences of the primers used in the Examples; 

- SEQ ID'S nos. 41-44 give the nucleotide sequences used in assembling the leader 
sequences used in the constructs described in the Examples; 

20 In the above sequence listings, the nucleotide sequences given are DNA sequences, as 

the genetic constructs of the invention described below will usually contain or consist of a 
DNA. As CGMMV is an RNA virus, it will be clear to the skilled person that these sequences 
will occur in the virus as the corresponding RNA sequence (i.e. with U replacing T). Also, it 
will be clear to the skilled person that the nucleotide sequences given above may be followed 

25 - both in the virus as well as in a construct of the invention - with a suitable termination 
codon, i.e. TAA/UAA, TAG/UAG or TGA/UGA (not shown). 

Furthermore, as will be clear to the skilled person, the nucleotide sequence encoding 
the coat protein will usually start with an ATG codon. For example, in SEQ ID NOs 1-16, 
the nucleotide sequence encoding the coat protein starts at the ATG codon at base positions 

30 523-525. (In the nucleotide sequence of SEQ ID NOs 1-16, the nucleotide sequence 
encoding the coat protein is preceded by another nucleotide sequence, e.g. encoding a 
movement protein. Accordingly, when hereinbelow reference is made to any nucleotide 
sequence of SEQ ID NOs 1-16, this also explicitly includes the nucleotide sequence starting 
at the ATG codon at base positions 523-525 of these SEQ ID's). 
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A particular purpose of the invention is therefore to provide a method that can provide 
plants with resistance against all these strains simultaneously, and more in particular a type of 
resistance that is agronomically useful, i.e. that can be used to protect (crops of) plants that 
are cultivated under circumstances wherein the high infectivity and persistance of CGMMV 
5 can be a major problem, such as the cultivation of cucumbers in greenhouses. 

In the invention, this problem is generally solved by transforming a plant with a 
polynucleotide sequence (e.g. as part of a genetic construct) that can provide the plant with 
so-called "replicase-mediated" resistance against CGMMV. In particular, this wil be a 
polynucleotide sequence that 
1 0 i) has been derived from the 1 29 kD sequence, the 57 kD sequence, or the 1 86 kD 

readthrough sequence of native CGMMV; 
ii) upon (at least) transformation into the plant and transcription into RNA - and usually 

also translation into the corresponding encoded protein - can provide the plant with 

resistance against CGMMV; but 
1 5 iii) does not encode any replicase activity. 

In particular, such a polynucleotide sequence can encode a polypeptide or protein that 
is capable of providing a plant with resistance against GCMMV, but that by itself has no 
replicase activity, for instance due to one or more alterations in its amino acid sequence, 
compared to the amino acid sequence encoded by the 129 kD sequence, 57 kD sequence, 
20 and/or 186 kD readthrough sequence of native CGMMV. 

However, according to one specific embodiment of the invention, the polynucleotide 
sequence may also comprise, or even consist of, the native 57 kD sequence. 

In a first aspect, the invention therefore relates to a method for generating resistance in 
a plant against CGMMV, said method comprising at least the step of transforming said plant 
25 with a polynucleotide encoding a defective variant of the replicase gene of CGMMV. 

In a further aspect, the invention also relates to a method for providing a transgenic 
plant and/or plant cell that is resistant against infection with CGMMV, comprising at least the 
step of transforming said plant or plant cell with a polynucleotide sequence encoding a 
defective variant of the replicase gene of CGMMV. 
30 In another aspect, the invention also relates to a genetic construct suitable for 

transforming a plant, said construct at least comprising a polynucleotide sequence encoding a 
defective variant of the replicase gene of CGMMV, and optionally further elements of genetic 
constructs known per se. The invention also relates to a plant, plant cell and/or plant material 
that has been transformed with a genetic construct of the invention. 
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The invention also relates to transgenic plants that contain a polynucleotide sequence 
encoding a defective variant of the replicase gene of CGMMV, and/or that have been 
provided with resistance against infection with CGMMV by the method of the invention. 

In the context of the invention, by the "replicase gene of CGMMV" is meant the native 
5 129 kD sequence, the native 57 kD sequence, and/or the combined native 186 kD 
"readthrough" product of the native 129 kD and native 57 kD sequences. 

By a '^native" sequence is meant any RNA sequence that naturally occurs in 
CGMMV, including all isolates and strains thereof, as well as any DNA sequence that 
corresponds to these naturally occuring RNA sequences. Examples of such native sequences 
1 0 are the 1 29 kD nucleotide sequences given in SEQ. ID no. 1 and SEQ. ID no. 1 7, the 57 kD 
nucleotide sequences given in SEQ. ID no. 3 and SEQ. ID no. 19 and the 186 kD nucleotide 
sequences given in SEQ. ID no.5 and SEQ. ID no.21 . It will be clear to the skilled person that 
there may be (further) naturally occuring variants of the RNA sequence from which the DNA 
sequences in the sequence listings were derived, and these (and the DNA sequences 
15 corresponding thereto) are also included within the term ''native sequence". 

By "a polynucleotide sequence encoding a defective variant of the replicase gene of 
CGMMV' in its broadest sense is meant a polynucleotide sequence that 
i) upon (at least) transformation into a plant and transcription into RNA generates 

resistance against infection with CGMMV in said plant; and 
20 ii) upon (at least) transformation into a plant and transcription into RNA does not lead to 

generation of (any) replicase activity in said plant (or at least -when it does lead to 

expression of some replicase activity - leads to expression of a replicase activity that is 

severely reduced compared to expression of the native gene encoding CGMMV 

replicase). 

25 Herein, the terms "plant", "transformed plant" and/or ""transgenic plant" include all 

parts or tissues of such a plant, including but not limited to individual cells of such a plant. 
These terms also includes material of or for such a plant, such as material that can be 
regenerated into a (mature) plant, including but not limited to protoplasts and/or callus tissue, 
or material that can be cuhivated into a mature plant, such as cultivation material. 

30 The plant is preferably a plant that is susceptible to infection with CGMMV, more 

preferably a plant belonging to the Cucurbitaceae family, such as melon (Cucumis melo), 
cucumber (Csativus), watermelon {Citrullus vulgaris) and bottlegourd (Lagenaria siceraria). 

Included within the term "CGMMV" are all known strains thereof, including those 
prevalent in Europe and Asia. In particular, the method of the invention can be used to protect 
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plants against strains of GCMMV prevalent in Europe (including Israel), such as those which 
are a problem in the cultivation of melons and in particular cucumbers in greenhouses, 
although the invention is not limited thereto. 

In doing so, a major advantage of the invention is that it can provide protection against 
5 several, and preferably all, (such) strains of CGMMV simultaneously. Another advantage of 
the invention is that it provides "absolute" protection against CGMMV, v^rhich means that - 
upon expression of a polynucleotide sequence encoding a defective replicase in a plant - 
essentially no viral particles can be detected in the transformed plant (material). The method 
of the invention therefore does not lead to a deferral or slowing down of the onset of 
1 0 symptoms, as may occur when so-called "coat protein-mediated" resistance is used. Also, the 
method of the invention leads to a high level of resistance, and may also have the advantage 
of a favorable temperatiu-e effect. 

Usually, the "nucleotide sequence encoding a defective variant of the replicase gene of 
CGMMV " will be a nucleotide sequence in which - compared to a nucleotide sequence 
1 5 encoding the corresponding native replicase of CGMMV - one or more nucleotides have been 
added, replaced and/or removed. In particular, the "nucleotide sequence encoding a defective 
variant of the replicase gene of CGMMV mdcy be a nucleotide sequence that comprises, and 
preferably consists of: 

- a nucleotide sequence corresponding to the native 129 kD sequence in which - 

20 compared to said native sequence - one or more nucleotides have been added, replaced 

and/or removed; 

- a nucleotide sequence corresponding to the native 1 86 kD sequence in which - 
compared to said native sequence - one or more nucleotides have been added, 
replaced and/or removed, e.g. in the part of the native 186 kD sequence corresponding 

25 to the 129 kD sequence, to the 57 kD sequence, or both; 

- a nucleotide sequence corresponding to the native 57 kD sequence; 

- a nucleotide sequence corresponding to the native 57 kD sequence in which - 
compared to said native nucleotide - one or more nucleotides have been added, 
replaced and/or removed; 

30 such that said nucleotide sequence is capable - upon (at least) transformation into a plant and 
transcription into RNA - to confer to said plant resistance against infection with CGMMV, 
and such that said nucleotide sequence - upon (at least) transformation into a plant and 
transcription into RNA - is not capable of generating of (any) replicase activity in said plant. 
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Usually, the "nucleotide sequence encoding the defective variant of the replicase gene 
of CGMMV' will encode a protein or polypeptide, more specifically a protein or polypeptide 
that: 

1 ) upon being expressed in a plant is capable of generating resistance against CGMMV in 
5 said plant; and 

2) upon being expressed in a plant has no replicase activity (or -when it has some replicase 
activity- has severly reduced replicase activity compared to the native CGMMV 

replicase). 

Such a protein or polypeptide will be generally referred to hereinbelow as "defective 
10 replicase"; and a polynucleotide sequence encoding such a protein or polypeptide will be 
referred to as a "polynucleotide sequence encoding a defective replicase". 

Usually, the defective replicase will be a derivative - such as an analog, homolog, 
variant, mutant, part, fragment or combination of two or more such parts or fragments, etc.- of 
the amino acid sequence encoded by the native 129 kD sequence, the native 186 kD sequence 
15 and/or the native 57 kD sequence, in which -compared to the amino acid sequence encoded by 
the corresponding native sequence- one or more amino acids have been added, replaced or 
removed, preferably replaced or removed, more preferably removed, leading to loss of 
replicase activity (or at least an inability to generate replicase activity when expressed in the 
plant). 

20 In particular, the defective replicase may be a protein or polypeptide that comprises, 

and preferably consists of: 

- an amino acid sequence corresponding to the amino acid sequence encoded by the 
native 129 kD sequence, in which - compared to said native sequence - one or more 
amino acids have been added, replaced or removed, preferably replaced or removed, 

25 more preferably removed; 

- an amino acid sequence corresponding to the amino acid sequence encoded by the 
native 186 kD sequence, in which - compared to said native sequence - one or more 
amino acids have been added, replaced or removed, preferably replaced or removed, 
more preferably removed, leading to loss of replicase activity; 

30 - an amino acid sequence corresponding to the amino acid sequence encoded by the 

native 57 kD sequence; 

- an amino acid sequence corresponding to the amino acid sequence encoded by the 
native 57 kD sequence, in which - compared to said native sequence - one or more 
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amino acids have been added, replaced or removed, preferably replaced or removed, 

more preferably removed; 
or any combination thereof, provided that the resulting protein or polypeptide shows no 
replicase activity, but is still capable - upon expression in a plant - to generate resistance 
5 against CGMMV in said plant. 

More in particular, the defective replicase may be a protein or polypeptide that 
comprises, and preferably consists of: 

- an amino acid sequence corresponding to a part or fragment of the amino acid 
sequence encoded by the native 129 kD sequence, or to a combination of two or more 

1 0 such parts or fragments; 

- an amino acid sequence corresponding to a part or fragment of the amino acid 
sequence encoded by the native 1 86 kD sequence, or to a combination of two or more 
such parts or fragments; or 

- an amino acid sequence corresponding to the amino acid sequence encoded by the 
15 native 57 kD sequence. 

such that the resulting protein or polypeptide shows no replicase activity, but is still capable - 
upon expression in a plant - to generate resistance against CGMMV in said plant. 

An amino acid sequence ''corresponding to a part or fragment of the amino acid 
sequence encoded by the native 186 kD sequence, or to a combination of two or more such 

20 parts or fragments" may for instance comprise: i) at least one part or fragment of the amino 
acid sequence encoded by the native 129 kD sequence combined with at least one part or 
fragment of the amino acid sequence encoded by the native 57 kD sequence (which 
combination of parts or fragments may or may not correspond to a contiguous amino acid 
sequence encoded by the native 186 kD sequence); ii) at least one part or fragment of the 

25 amino acid sequence encoded by the native 129 kD sequence combined with the full amino 
acid sequence encoded by the native 57 kD sequence, and/or iii) at least one part or fragment 
of the amino acid sequence encoded by the full native 129 kD sequence combined with at 
least one part or fragment of the amino acid sequence encoded by the native 57 kD sequence. 
It is known, however, that expression in a plant of a nucleotide sequence encoding the 

30 full 129 kD sequence of the native replicase usually does not provide resistance against 
infection with CGMMV, but may even - upon infection of the plant - promote or facilitate 
multiplication of the virus. Therefore, in one embodiment, the invention does not comprise 
the expression in a plant of said replicase, nor the use of a polynucleotide sequence encoding 
such a replicase. 
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Even more preferably, the defective replicase is a protein or polypeptide that consists 

of: 

- an amino acid sequence corresponding to a part or fragment of the amino acid 
sequence encoded by the native 129 kD sequence, or a combination of two or more 

5 such parts or fragments; such that the resulting protein or polypeptide shows no 

replicase activity, but is still capable, upon expression in a plant, to generate resistance 
against CGMMV in said plant; or 

- an amino acid sequence corresponding to the amino acid sequence encoded by the 
native 57 kD sequence. 

1 0 Any such parts or fragments may also contain one or more fiirther amino acid 

substitutions, insertions or deletions compared to the native sequence, but this is not preferred. 

Most preferably, the defective replicase is a so-called "truncated replicase", i.e. an 
amino acid sequence corresponding to the amino acid sequence encoded either by the 129 kD 
sequence and/or by the 186 kD sequence, from which - compared to the native amino acid 

1 5 sequence - one or more amino acid residues are lacking at the carboxyl-terminus, such that the 
resulting protein or polypeptide shows no replicase activity, but is still capable - upon 
expression in a plant - to generate resistance against CGMMV in said plant. (In case of a 
truncated replicase based upon the 1 86 kD sequence, this usually means that the resulting 
protein will contain the full amino acid sequence of the 129 kD sequence, as well as part of 

20 the amino acid sequence of the 57 kD sequence (i.e. that is contiguous to the 129 kD 

sequence in the amino acid sequence encoded by native 186 kD sequence), with one or more 
amino acids lacking at the carboxy-terminus of the 57 kD part, although the invention in its 
broadest sense is not limited thereto). 

The polynucleotide sequence that encodes such a truncated replicase may either 

25 comprise, or preferably consist of, the frill native 129 kD sequence or 186 kD sequence, 

respectively, in which a stopcodon has been infroduced at a desired site, or a polynucleotide 
sequence from which - compared to the frill native 129 kD sequence or 186 kD sequence, 
respectively - one or more codons coding for the carboxy-terminal amino acid residues have 
been removed, i.e. starting from the 3 'end of the native sequence(s). 

30 As mentioned below, preferably a stopcodon is introduced in to the native sequence, in 

particular in the so-called ODD motiv or in the P-loop. Examples thereof are the 
polynucleotide sequences comprised in the vectors shown in Figures 3-8, and as described in 
the Experimental Part. 
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Again, any such truncated replicase may also contain one or more amino acid 
substitutions, insertions or deletions compared to the native sequence, but this is not preferred. 

As mentioned above, (the polynucleotide sequence encoding) the defective replicase is 
such that - after expression in a plant or plant cell - it is still capable of generating resistance 
5 against CGMMV in said plant. Usually, this means that the defective replicase will have at 
least one biological fiinction that allows the defective replicase to protect the plant against 
CGMMV infection, such as for example down-regulation of viral replication or interference 
with the replication of the wild-type CGMMV, for instance by competing with wild-type 
virus for the replication machinery in the plant (cell). It will be clear that in order to achieve 

10 such a biological function, the defective replicase must usually have a certain minimal level 
of amino acid similarity with the amino acid sequences encoded by the native 129 kD, 186 kD 
and/or 57 kD sequences. In so far as the defective replicase is similar to the corresponding 
native amino acid sequence, this may be because it contains - on the corresponding amino 
acid positions - the same amino acid residues as the native amino acid sequence, or amino 

15 acid residues comparable thereto. The latter will usually comprise so-called ''conservative" 
amino acid substitutions, for instance involving replacing a given acidic or basic amino acid 
residue by another acidic or basic amino acid residu. 

However, there will also be differences in amino acid sequence between the defective 
replicase and the native replicase (i.e. the 129 kD, 1 86 kD or 57 kD protein), such that the 

20 defective replicase will no longer provide replicase activity. The skilled person will be able to 
select appropriate alterations to the amino acid sequence of the native replicase. As will be 
clear to the skilled person, a single (amino acid or nucleotide) alteration may be sufficient, or 
two or more such alterations may be required, dependant upon the position and nature of the 
alteration(s) compared to the amino acid sequence of the native replicase. 

25 Whether a given polynucleotide sequence encodes a defective replicase according to 

the invention - or at least is capable of protecting a plant against infection with CGMMV - 
can simply be tested by transforming a plant, plant cell or plant material with a construct 
containing said polynucleotide sequence, and then exposing the plant, plant cell, plant 
material, and/or a mature plant generated therefrom, to CGMMV under conditions such that 

30 infection may occur. It can then be easily determined whether the polynucleotide 

sequence/construct is capable of protecting the plant, i.e. by suitably determining the presence 
of the virus, or simply by the presence or absence of symptoms of CGMMV-infection. 

In general, as a minimum, when the defective replicase contains any amino acid 
substitutions or insertions, it will have an amino acid homology (i.e. identity on corresponding 
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position) with the corresponding native replicase protein of at least 80%, preferably at least 
90%, more preferably at least 95%, with amino acid deletions not being taken into account, 
and a single amino acid insertion being counted as a single alteration. 

In general, as a minimum, when the defective replicase contains one or more amino 
5 acid deletions, it will usually contain at least 30%, preferably at least 50%, more preferably at 
least 70%, and usually 80-90%, and may even contain as much as 95-99%, of the amino acid 
sequence of the corresponding native replicase protein, with any amino acid insertions or 
substitutions not being taken into account. 

A truncated replicase based upon the 129 kD sequence will usually contain at least 

10 50%. preferably at least 70%, and may contain as much as 80-95%, of the amino acid 

sequence of the native replicase. A truncated replicase based upon the 1 86 kD sequence may 
contain the full 129 kD protein followed by one or more amino acids from the 57 kD 
sequence, and usually contains the full 129 kD sequence followed by 1-95%, preferably 5- 
50%, of the 57 kD sequence. 

15 The differences in amino acid sequence mentioned above can be differences compared 

to any of the amino acid sequences given in SEQ ID's 2, 4, 6 and/or 18, 20, 22, and/or 
compared to any naturally occuring variant of these amino acid sequences. These differences 
are at least such that the resulting protein does not correspond to a naturally occuring/native 
protein (including those given in SEQ ID's 2, 4, 6 and/or 18, 20, 22). 

20 The polynucleotide sequences used in the invention are such that they encode the 

above defective replicases. For this purpose, they may contain the same codons as in the 
corresponding positions on the native 129 kD, 186 kD and/or 57 kD sequence, or codons 
equivalent thereto due to the degeneracy of the genetic code. 

The polynucleotide sequence encoding the defective replicase can be provided in a 

25 manner known per se, for instance starting from the known sequence of the native 129 kD, 57 
kD and/or 186 kD sequences, and/or from a nucleic acid that encodes said sequences. Usually, 
this will involve introducing one or more deletions, substitutions and/or insertions of one or 
more nucleotides, or even of one or more codons into, or compared to, the native sequence. 
Such deletions, substitutions and/or insertions will be collectively referred to hereinbelow as 

30 ''alterations". 

Accordingly, the polynucleotide sequence encoding the defective replicase may be a 
sequence that contains one or more such alterations compared to any of the nucleotide 

sequences given in SEQ ID's 1,3,5 and/or 17, 19, 21, and/or compared to any naturally 
occuring variant of these nucleotide sequences (including DNA sequences corresponding to 
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the RNA sequences as present in the virus). These differences are at least such that the protein 
encoded by the polynucleotide sequence does not correspond to a naturally occuring/native 
protein (including those given in SEQ ID's 2, 4, 6 and/or 16, 18, 22). 

Furthermore, besides the alterations mentioned above, and compared to nucleotide 
5 sequences given in SEQ ID's 1, 3, 5 and/or 17, 19, 21 and/or compared to any naturally 

occuring variant of these nucleic acid sequences (including DNA sequences corresponding to 
the RNA sequences as present in the virus), the polynucleotide sequences may further contain 
one or more alterations that lead to a codon that encodes the same amino acid as the codon 
given for the corresponding position in SEQ ID's 1, 3, 5 and/or 17, 19, 21, and this may even 

10 lead to a fully or totally artificial and/or synthetic sequence. Also, compared to nucleotide 
sequences given in SEQ ID's 1, 3. 5 and/or 17, 19, 21 and/or compared to any naturally 
occuring variant of these nucleic acid sequences (including DNA sequences corresponding to 
the RNA sequences as present in the virus), the polynucleotide sequences may further contain 
one or more alterations that lead to a conservative amino acid substitution, i.e. as mentioned 

15 above. 

Providing a polynucleotide sequence that contains the desired alterations will be 
within the skill of the artisan and can involve techniques such as nucleic acid synthesis using 
an automated nucleic acid synthesis technique; introduction of (point)mutations into a nucleic 
acid that comprises the native 129 kD, 57 kD, and/or 186 kD sequences; and/or using or 

20 suitably combining parts or fragments of the 129 kD, 57 kD and/or 1 86 kD sequences, or any 
combination thereof Also, in providing such a polynucleotide sequence, the skilled person 
may take into account the degeneracy of the genetic code and/or conservative amino acid 
substitutions, as mentioned above. 

In order to provide a polynucleotide sequence that encodes a truncated replicase as 

25 defined above, a technique involving the introduction of a stopcodon into the native sequence 
is particularly preferred. 

A particularly preferred technique of introducing the above alterations - including 
stopcodons - involves the use of a PGR reaction, in which the desired alterations are 
introduced into the amplified sequence(s) by the use of modified primers, i.e. primers that 

30 contain a suitable "mismatch" compared to the template sequence, leading to the desired 

alteration in the amplified sequence. This PCR-based technique may also be used to introduce 
one or more restriction sites into the amplified sequence in order to facilitate the cloning of 
the amplification products into the desired transformation vectors. 
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As further described in the Experimental Part, this may involve a single PCR-reaction, 
but may also involve two or more PCR reactions, each leading to a part of intended final 
sequence encoding the defective replicase in which the primers (e.g. with the desired 
alteration) form the ends of the fragments. These fragments may then be combined, for 
5 instance to provide a polynucleotide sequence that comprises a combination of such 
fragments, and/or to reconstitute the full 129 kD, 57 kD and/or 186 kD sequence, now 
containing the desired alteration compared to the native sequence, such as a stopcodon. 

The PCR-reactions and the further steps following amplification, such as 
combining/joining the amplified sequences, can be carried out in a manner known per se, for 
10 instance as described in the Experimental Part and/or using the techniques described in US-A- 
4,683,202; Saiki et al., Science 239 (1988), 487-491 or PCR Protocols, 1990, Academic 
Press, San Diego, California, USA. 

As the template for the PCR-reaction, a nucleic sequence encoding the native 129 kD, 
57 kD and/or 1 86 kD sequence can be used, such as a cDNA derived from the native RNA 
1 5 sequence, or a plasmid containing such a sequence, including those described in the 

Experimental Part. The template used may itself already contain one or more alterations, 
compared to the corresponding native sequence. 

As mentioned above, a preferred alteration involves the introduction of a stopcodon 
into the native 129 kD or 1 86 kD sequence, such that - upon transformation into a plant - the 
20 polynucleotide sequence thus obtained causes expression of a truncated replicase. In 

particular, such a stopcodon may be introduced into a sequence corresponding to the native 
129 kD sequence, more in particular to that part of the native sequence that corresponds to the 
so-called GDD-motiv or to the so-called P-loop. 

The polynucleotide sequence encoding the defective replicase is preferably in the form 
25 of - e.g. forms part of and/or is incorporated within - a genetic construct. The genetic 

construct is preferably a construct suitable for the transformation of a plant, plant cell and/or 
plant material, such as a plasmid, cosmid or vector, including co-integration vectors or binary 
vectors. The genetic construct may be DNA or RNA, and is preferably dsDNA. 

Such a construct may fiirther contain all known elements for genetic constructs, and in 
30 particular for genetic constructs intended for the transformation of plants, as long as the 
presence thereof does not interfere with the CGMMV resistance to be provided by the 
polynucleotide sequence encoding the defective replicase. Some non-limiting examples of 
such elements include leader sequences, terminators, enhancers, integration factors, selection 
markers, reporter genes, etc., and suitable elements will be clear to the skilled person. 
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These further elements may or may not be derived from plants, and may or may not be 
homologous to the plant that is to be transformed with the construct of the invention 
(hereinbelow referred to as the ''target planf"). For instance, the further elements may also 
have been derived from micro-organisms, viruses, etc., and may also be elements that are 
5 natively associated with the CGMMV sequence, such as the native CGMMV leader sequence 
(5'-UTR sequence). 

The nucleotide sequences encoding these further elements may have been isolated 
and/or derived from a naturally occuring source - for instance as cDNA - and/or from known 
available sources (such as available piasmids, etc.), and/or may have been provided 

1 0 synthetically using known DN A synthesis techniques. 

For instance, a construct of the invention will usually contain a suitable promoter 
operatively linked to the polynucleotide sequence encoding the defective replicase, e.g. such 
that it is capable of directing the expression of the polynucleotide sequence. Suitable 
promoters can be chosen from all known constitutive, inducible, tissue specific or other 

15 promoters that can direct expression of a desired nucleotide sequence in a plant and/or in part 
of a plant, including specific tissues and/or individual cells of the plant. In particular, 
promoters are used that are suitable for use in species of the Cucurbitaceae family, such as 
cucumber. 

A specifically preferred promoter is the plastocyanine-promoter. Use of the 35S 
20 promoter is less preferred, as it may be less reliable in cucumber. 

The terminator can be any terminator that is effective in plants. A particularly 
preferred terminator is the nos-3' terminator. 

The selection marker can be any gene that can be used to select - under suitable 
conditions such as the use of a suitable selection medium- plants, plant material and plant 
25 cells that contain -e.g. as the result of a succesful transformation - the genetic construct 

containing the marker. A particularly preferred selection marker is the nptll-gsne, which can 
be selected for using kanamycin. 

The construct of the invention further preferably contains a leader sequence. Any 
suitable leader sequence, including those of viral origin, can be used. Preferably, a leader 
30 sequence essentially identical to the 5 'untranslated (5'-UTR) region of the CGMMV genome 
is used. This may be derived from the viral RNA, or may be provided synthetically, e.g. as 
described in the Experimental Part. 

Although not preferred, the invention also encompasses constructs that encode a 
fusion of a defective replicase as mentioned above, and at least one further amino acid 
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sequence, such as a protein or polypeptide, or a part or fragment thereof. Preferably, 
expression of a defective replicase as (part of) such a fusion does not detract from the desired 
biological activity (i.e. protection against infection with CGMMV). 

The construct of the invention can be provided in a maimer known per se, which 
5 generally involves techniques such as restricting and linking nucleic acids/nucleic acid 
sequences, for which reference is made to the standard handbooks, such as Sambrook et al, 
"Molecular Cloning: A Laboratory Manual" ( 2nd.ed.), Vols. 1-3, Cold Spring Harbor 
Laboratory (1989) of F. Ausubel et al, eds., "Current protocols in molecular biology", Green 
Publishing and Wiley Interscience, New York (1987). 

10 According to one embodiment, the genetic construct is preferably (also) in a form that 

can be stably maintained or inherited in a micro-organism, in particular a bacterium, more in 
particular a bacterium that can be used to transform a plant or plant material, such as 
Agrobacterium. In a further aspect, the invention also relates to such a micro-organism, in 
particular a bacterium, more in particular a bacterium that can be used to transform a plant, 

1 5 such as Agrobacterium, that contains a genetic construct according to the invention. 

The genetic construct can be transformed into the target plant, plant cell or plant 
material by any suitable transformation technique known per se, including transformation 
with Agrobacterium, transformation with "denuded" DNA, for instance through particle 
bombardment or transformation of protoplasts through electroporation or treatment with PEG. 

20 Examples of suitable vectors systems for use with Agrobacterium are for instance 

binary vectors such as pBI121 and derivatives thereof; co-integration vectors such as 
pGVlSOO and derivatives of pBR322. Suitable systems for transformation with denuded DNA 
include £.co/z- vectors with high copy number, such as pUC -vectors and pBluescript II (SK+) 
vectors. 

25 Upon transformation, the construct may for instance be incorporated into the genomic 

DNA of the plant, or it may be maintained/inherited independantly in the plant (cell). 

In a further aspect, the invention therefore comprises a method in which a plant, plant 
cell or plant material is transformed with a genetic construct as described above. 

This method may also comprise cultivating the transformed plant cell or plant material 
30 into a mature plant, and may also comprise sexually or asexually reproducing or multiplying 
the transformed plant (and/or the mature plant obtained from the transformed plant cell or 
plant material). 

The invention therefore also relates to a plant, plant cell or plant material, that has 
been transformed with - or more generally contains- a genetic construct as described above. 
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Preferably, such a plant, plant cell or such plant material is resistant against infection with 
CGMMV as described herein. 

The invention furthermore relates to cultivation material such as seed, tubers, roots, 
stalks, seedlings etc. for such a plant, as well as descendants of such a plant, obtained through 
5 sexual or asexual reproduction techniques. Such cultivation material and/or descendants most 
preferably still contain or have inherited the genetic construct of the invention, and more 
preferably also are resistant against infection v^th CGMMV as described herein. 

The invention will now be illustrated by means of the following non-limiting 
Experimental Part and by means of the Figures, in which: 
10 - Figure 1 is a schematic representation of the genome of CGMMV; 

Figure 2 gives a phylogenetic tree of CGMMV-coat protein (cp) for CGMMV-SH and 
the ten European isolates, using the method of J. Hein with weighted residue table. 
- Figures 3 - 8 show examples of some preferred genetic constucts of the invention, i.e 
those listed in Table 6 below. 
1 5 Also, in the Experimental Part hereinbelow, enzymes, khs, etc. were usually used 

according to the instructions of the manufacturer and/or using well-established protocols, 
unless indicated otherwise. 

Experimental Part 

20 

Example I: Cloning of the coat protein genes of 10 CGMMV-isolates 
1. Collecting CGMMV isolates 

To make use of coat protein-mediated protection (CPMP) strategy against CGMMV, it 
is necessary to clone the coat protein cistrons of the isolates, that are economically important. 

25 As the only sequence information available for CGMMV is derived from watermelon strains 
from the Far East, it was first decided to collect CGMMV isolates of important cucumber 
culture areas in Europe and the Mediterannean area. Table 1 lists the isolates collected from 
various geographical areas. All isolates were propagated on cucumber, and infected leaf 
material was stored at -80°C. The symptoms obtained after infection of cucumber cv. Hokus 

30 are listed in Table 1 . 



2. Design of PCR primers 

The possibility of sequence divergence among the various collected isolates, and 
between the isolates and the published sequences of CGMMV-SH and CGMMV-W exists. In 
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order to identify nucleotide regions with a high degree of sequence conservation, that could 
serve as a basis of PCR primer design, an alignment study was carried out on corresponding 
sequences of CGMMV-SH, CGMMV-W and of some other related members of the 
tobamovirus group: Sunn-Hemp Mosaic Virus (SHMV, a variant of TMV) and Pepper Mild 
5 Mottle Virus (PMMV). For this purpose, a region of 800 nucleotides just 5' of the coat 
protein cistron and a region of 170 nucleotides forming the far 3' of the viral genome were 
compared. In this sequence alignment, regions with sufficient sequence homology among all 
compared viruses were identified. Based on these sequences, sets of PCR primers were 
designed, which are listed in Table 2. 

10 

Table 2. Design of primers for the RT-PCR amplification of coat protein sequences of 
CGMMV-isolates. 



primers 


sequence 


position on CGMMV- 
SH sequence 


5' primers 






97G01 


AGGTGTCAGTGGAGAACTCATTGA 


5004 


97G02 


GGCGTTGTGGTTTGTGG 


5210 


97G03 


CTGTAGGGGTGGTGCTACTGT 


5248 


3' primer 






97G18 


GCCCATAGAAACTTCAACGTC 


6370 



15 3. AmpUncation of the coat protein regions 

From leaf material of cucumber plants infected with each of the 1 1 isolates described 
in Tabel 1, a total RNA extraction was prepared. Using each of the 5' primers listed in Table 
2 in combination with 3' primer 97G18, reverse transcription of RNA and PCR amplification 
of cDNA with an annealing temperature of 55°C was established using a kit manufactured by 
20 Perkin Elmer Cetus. Especially in the reactions with the 5' primer 97G03 amplification 
products of the correct size were obtained for each of the 1 1 RNA samples. The PCR 
amplification products were directly cloned in T/A cloning vector pCR2.1 and introduced in 
E.coli strain INVaF'. For each of the RNA samples, the correct size of the cloned product 
(1.12 kb) was verified, and the clones were stored at— 80°C. The amplification products of 
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CGMMV-isolates 1 to 10 cloned in pCR2.1 were designated pKG4301 to pKG4310, and the 
one of CGMMV-SH cloned in pCR2.1 was designated pKG43 1 1 . 



4. Nucleotide sequence analysis of the coat protein cistrons 

5 The sequences of the complete inserts of the plasmids pKG4301 to pKG43 1 0 were 

determined by reading in both directions using ml 3 forward en ml 3 reverse sequencing 
primers. The sequence of the insert of pKG43 1 1 was already known, as this plasmid contains 
a cDNA fragment of CGMMV-SH. 

Sequence analysis confirmed, that in each case indeed the correct cDNA fragment of 

10 CGMMV had been obtained and cloned. With one exception, each amplified and cloned 
cDNA fragment consisted of 11 23 base pairs, containing the CGMMV coat protein cistron 
and a large part of the CGMMV movement protein cistron. 

The cloned sequences of all collected European isolates (isolates 1 to 10) are 
approximately 97% homologous among each other, but differ on average by 10% from the 

1 5 published sequence of CGMMV-SH. Comparison of each individual sequence revealed, that 
isolates 1 and 2, both from Eastern Europe, are extremely alike. The same very high degree of 
identity was found between both isolates from cucumber greenhouses in the Netherlands 
(isolates 4 and 5) and between both isolates obtained from the Almeria area in Spain (isolates 
9 and 10). None of the cDNA sequences was 1 00% identical to any of the other ones, but the 

20 differences in sequence are no more than a few nucleotides, and sometimes only one 

nucleotide in the coding region of the coat protein cistron. The Japanse isolate CGMMV-SH 
is clearly different from any of the European isolates. 

5. Coat protein amino acid sequence analysis 

25 Based on the nucleotide sequences of the Open Reading Frames (ORP) of the coat 

protein cistrons of the 10 isolates, the amino acid sequence could be deduced. In each of the 
analyzed sequences, the ORE consisted of a region of 486 nucleotides, coding for a protein of 
161 amino acid residues. The predicted molecular mass of this protein is 17.3 kD, 
corresponding to earlier published results. The homology among the predicted protein 

30 sequences of the various isolates is as high as 98.1%. The only deviations are found for amino 
acid residue 19 (usually valine), residue 65 (mostly serine) and residue 84 (mostly leucine). 

The sequence of the coat protein of the Japanse isolate CGMMV-SH only differs by 1 
amino acid (residue 65) from the consensus sequence. 
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Example II: Cloning of the replicase gene of CGMMV 

1. Strategy for replicase-mediated protection 

5 By way of example, two approaches to replicase-mediated protection (RMP) against 

virus infections in plants were investigated. 

One approach makes use of defective replicase genes in the form of truncated Open 
Reading Frames (ORF), in which the sequence downstream from the ODD motif had been 
truncated or altered through mutation. 
10 The other approach makes use of the expression of the 'read-through' part of the 

replicase gene, i.e. the 57 kD sequence. It is thought that this ORF is not translated in the 
plant cell, but forms part of a larger 'read-through' ORF combining the coding regions of both 
the 129 kD replicase gene and the putative 57 kD protein gene, resulting in a protein of 189 
kD. However, expressing merely the 57 kD protein ORF in plant cells may result in a 
1 5 extremely strong resistance to infection by both virus particles and viral RNA, which also 
would be capable of resisting high temperatures, as well as high inoculum concentrations. 

For either of these approaches, either the full-length CGMMV replicase gene must be 
cloned, or both constituting parts must be cloned separately. 

20 

2. Design of primers 

Because of the high sequence homology of the coat protein genes of 1 1 CGMMV 
isolates it was assumed, that the sequences of the replicase genes of the various isolates would 
be also highly conserved. Based on the complete sequence of the CGMMV-SH isolate, 
25 primers were designed for the PCR amplification of the 57 kD ORF and of the 129 kD ORF 

(Tables 3 and 4). The primers were designed such, that they contain restriction sites for the 
future cloning of the amplification products. The 5' primers contain an A^col-site positioned 
such, that it will coincide with the ATG start codon of the amplified ORF. The 3' primers 
contain a .Sacl-site downstream from the stop codon. 



30 
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Table 3. Design of primers for the LR-RT-PCR amplification of the 57 kD replicase sequence 

ofCGMMV. 



primers 


sequence 


position on CGMMV-SH 
sequence 


5' primer 






98A88 


CCATGGAGAATTCGCTGTATGTC 
C 


3497 


3' primer 






98A86 


CGAGCTCTCGACTGACACCTTAC 


5001 



5 Table 4. Design of primers for the LR-RT-PCR amplification of the 129 kD replicase gene 
sequence of CGMMV. 



primers 


sequence 


position on CGMMV-SH 
sequence 


5' primers 






98A84 


CCATGGCAAACATTAATGAAC 


59 


98A85 


CAACCATGGCAAACATTAATG 


56 


3' primer 






98G63 


TAACAGGGAGGAAAATATTACG 





1 0 3. Long-Range Reverse Transcriptase Polymerase Chain Reactions 

From the known sequence of CGMMV-SH it was derived that the size of the 57 kD 
protein gene is 1.5 kb. Such a size is at the limit of the size range that can be amplified in a 
PCR with standard Taq polymerase. For the amplification of this cDNA fragment, and 
certainly for the amplification of the cDNA fragment for the 1 29 kD replicase gene, a 
1 5 different polymerase suitable for long range amplifications must be used. In these 
experiments, rTth DNA polymerase was used. 

For direct amplification of cDNA fragments from total RNA extractions a RT-PCR kit 
is normally employed, combining in one reaction the activity of the Reverse Transcriptase 
(RT), producing a single cDNA strand complementary to the RNA template strand beginning 



wo 01/09300 



24 



PCT/NLOO/00534 



at one primer annealed to the 3 ' end of the RNA molecules, and the activity of the 
Polymerase, amplifying the thus produced single stranded cDNA molecule in a normal PGR 
fashion. 

Because of the need to use long range polymerase, it was attempted to combine the RT 
5 with the long range polymerase to produce in one reaction large-size amplification products 
directly from total RNA extracts. This type of reaction was called a Long Range Reverse 
Transcriptase Polymerase Chain Reaction (LR-RT-PCR). 

4. LR-RT-PCR ampliflcation and cloning of the 57 kD protein gene 

10 Using the primers listed in Table 3 and the LR-RT-PCR described above, a specific 

1.5 kb amplification product was obtained from total RNA extracts of cucumber leaves 
infected with CGMMV-4. This isolate was chosen, as it originated from the Dutch cucumber 
greenhouse cultures, and would thus represent an economically important isolate. Because 
long range polymerases contain a 'proof reading' activity and do not leave A-additions on the 

15 ampliflcation products, as does the Taq polymerase normally employed in PGR, direct 

cloning of the amplification products in a TA vector accomodating the A-additions was not 
possible. Therefore, the amplification products were briefly treated with Taq polymerase, 
resulting in the addition of A-overhangs on the amplified DNA molecules. These molecules 
could then easily be cloned in the TA vector pCR2.1, and transformed to E.coli MCI 061. 

20 Clones with the correct insert size of 1 .5 kb were stored at -80°C and are known as pKG4321. 

5. Sequence analysis of the 57 kD protein gene 

The nucleotide sequence of the cloned insert of pKG4321 was determined by double- 
stranded sequencing using ml 3 forward en ml 3 reverse primers and subsequent primer 

25 walking steps. The ORF coding for a putative 57 kD protein gene (SEQ ID no 3) showed 90% 
homology at the nucleotide level to the corresponding sequence of the Japanse isolate 
CGMMV-SH (SEQ ID no 19). The predicted amino acid sequence (SEQ ID no 4) shows a 
98.2% homology to the one predicted by the CGMMV-SH sequence (SEQ ID no 20). 
The GDD motif characteristic for viral replicase genes resides at amino acid residues 364- 

30 366. 

6. LR-RT-PCR amplification and cloning of the 129 kD replicase gene 

Using the primers listed in Table 4 in a Long Range Reverse Transcriptase Polymerase 
Chain Reaction as described under 3 , one specific amplification product of 3.5 kb 
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representing the viral 129 kD replicase gene was obtained from total RNA of cucumber leaves 
infected with CGMMV isolate 4 (Table 1). Because long range polymerases contain a 'proof 
reading' activity and do not leave A-additions on the amplification products, as does the Taq 
polymerase normally employed in PGR, direct cloning of the amplification products in a TA 
5 vector accomodating the A-additions was not possible. Therefore, the amplification products 
were briefly treated with Taq polymerase, resulting in the addition of A-overh£ings on the 
amplified DNA molecules. These molecules could easily be cloned in the TA vector pCR2.1, 
and transformed to E.coli MC1061 . Clones with the correct insert size of 3.5 kb were stored at 
-80°C and are known as pKG4322. 

10 

7. Sequence analysis of the 129 kD protein gene 

The nucleotide sequence of the amplification product cloned in pKG4322 was 
determined by double-stranded sequencing using ml 3 forward en ml 3 reverse primers, and a 
primer walking strategy. The ORF coding for the 129 kD replicase gene (SEQ ID no 1) 

1 5 showed 88% homology at the nucleotide level to the corresponding sequence of the Japanse 
isolate CGMMV-SH (SEQ ID no 17). The ORF of the Dutch cucumber greenhouse isolate 
codes for a replicase protein of 11 44 amino acids, which is one amino acid in extra in 
comparison to the CGMMV-SH strain. The predicted amino acid sequence (SEQ ID no. 2) 
shows a 97.1% homology to the one predicted by the CGMMV-SH sequence (SEQ ID no 18). 

20 Two GDD motifs are found at amino acid residues 256-258 and 540-542. 

8. Site-directed mutagenesis of the 129 kD ORF 

As explained above, one approach to obtain RMP in plant cells was to make use of 
replicase genes truncated either in the GDD motif, or truncated in the P-loop of the helicase 

25 domain. In order to create gene expression cassettes carrying such truncated genes, a site- 
directed mutagenesis approach was followed to introduce stop codons at the required 
positions in the ORF. To this end, several parts of the 129 kD replicase ORF were re- 
amplified from pKG4322 as a template using specifically designed primers that included 
unique restriction sites for future re-assembling of the thus amplified products, as well as the 

30 required mutations in the form of stop codons (Table 5). These stop codons should ensure the 
proper truncation of the translation of the protein. Several stop codons were designed one 
after the other in the three reading frames in these primers, thus ensuring an effective 
translation-deficient mutation. 
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Table 5. Design of primers for the site-directed mutagenesis of tiie 129 kD replicase gene of 
CGMMV. 



primers 


sequence 




98L99 


GAGCTCGGATCCACTAGTAACGGC 




98L107 


TAGAGCTCTTGAAGCTAAGCAAATTCCG 




98L108 


TTCAAGAGCTCTAATCACCGAAGACAAAGGC 




98L102 


GAATTATATCGATTATCTATCGGC 




98L103 


GATAATCGATATAATTCTTCATCTGCC 




98L104 


AACTAGTAATTGATGATCTGTTCAAGAAG . 




98L105 


AATTACTAGTTTCCGGAAGCAAGCAGCTCAG 




98L106 


GCCCTCTAGATGCATGCTCGAG 





5 Using primers 98L103 and 98L104, a fragment from the downstream half of the 129 

kD gene from the GDD motif up to the C7al-site was amplified, while simultaneously stop 
codons were introduced at the site of the GDD motif. This fragment cloned in TA vector 
pCR2.1 was called pKG4325. 

Using primers 98L105 and 98L106, a fragment corresponding to the 5' half of the 129 
10 kD gene up to the GDD motif was amplified, while simultaneously a stop codon was 

introduced at the site of the GDD motif. This fragment cloned in T/A vector pCR2.1 was 
called pKG4326. 

Replacing an Xbal-Clal fragment of pKG4322 with the combined amplified products 
of pKG4325 and pKG4326 reconstitutes the full-length 129 kD replicase ORF of pKG4322 
15 with stop codons introduced at the site of the GDD motif This construct is named pKG 4329. 

Using primers 98L99 and 98L107, a fragment at the far downstream end of the 129 kD 
gene from the P-loop to the end of the ORF was amplified, while simultaneously stop codons 
were introduced at the site of the P-loop. This fragment cloned in T/A vector pCR2.1 was 
called pKG4327. 

20 Using primers 98L108 and 98L102, a fragment corresponding to a central part of the 

129 kD gene from the GDD motif up to the P-loop was amplified, while simultaneously a 
stop codon was introduced at the site of the P-loop. This fragment cloned in T/A vector 
pCR2.1 was called pKG4328. 
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Replacing an BamHl-CIal fragment of pKG4322 with the combined amplified 
products of pKG4327 and pKG4328 reconstitutes the full-length 129 kD replicase ORF of 
pKG4322 with stop codons introduced at the site of the P-loop. Thus construct is named 
pKG4330. 

5 

Example III: Transformation of cucumber 

1. Construction of a CGMMV-leader sequence 

For optimal expression and stability of the replicase gene transcripts in plant cells, it 

10 was thought necessary to add a sequence identical to the 5' untranslated (5' UTR) region of 
the CGMMV genome upstream from the ORF sequence in the plant expression cassette. 
Because the 5' UTR of viral genomes contain highly repetitive RNA, this sequence could not 
be obtained by RT-PCR amplification, as no specific primers could be designed. Instead, a 
synthetic region identical to the 5' UTR of CGMMV-SH was assembled fi-om the four 

15 oligonucleotide sequences: 

97G40 (CTAGAGTTTTAATTTTTATAATTAAACAAA), 
97G4 1 (TC AAAATTAAAAATATTAATTTGTTTGTTGTTGTTG), 
97G42 (CAACAACAACAACAACAAACAATTTTAAAACAACAC) and 
97G43 (TTGTTGTTTGTTAAAATTTTGTTGTGGTAC). 

20 These oligonucleotides were designed such, that outside the sequence corresponding 

to the 5' UTR they contain restriction sites fovXbal and Ncol, thus facilitating fiirther 
cloning. Adding the four oligonucleotides together will cause spontaneous assembling due to 
the design of extensive regions of overhang. Using these restriction sites, the assembled 
mixture was cloned in a plant expression vector containing an Arabidopsis thaliana 

25 plastocyanin promoter (Vorst et al. , 1 993) and a Agrobacterium tumefaciens nopaline 
synthase terminator sequence (Depicker et al, 1982) in a pUC19-derived plasmid. This 
expression cassette was called pKG1315. The complete expression cassette consisting of the 
plastocyanin promoter, the CGMMV leader sequence and the nos terminator was 
subsequently removed from pKG1315 using Hindlll and EcdRl as restriction enzymes, and 

30 recloned in the corresponding restriction sites of: 

1) an intermediate type Agrobacterium transformation vector for coi'ntegrate type vector 
systems containing an nptW selectable marker gene cassette to create pKG1575, and 

2) an intermediate type Agrobacterium transformation vector for coi'ntegrate type vector 
systems containing no selectable marker gene to create pKGl 110. 
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2. Construction of transformation vectors 

The three cloned and modified replicase constructs of pKG4321, pKG4329 and 
pKG4330 were isolated from the plasmids by restriction with BamEl (filled in with Klenow) 
and Ncol and ligated into the Sad (filled in with Klenow) and Ncol sites of each of the two 
5 transformation vectors pKG1575 and pKGl 1 10, resulting in a total of six transformation 
vectors, listed in Table 6. 

Table 6. List of six transformation vectors for the expression in plants of parts of the 
CGMMV-replicase gene. 

10 



vector 


vector type 


modified CGMMV-replicase gene 


pKG4331 


intermediate type with nptll 


57kDORF 


pKG4332 


intermediate type with nptll 


129 kD ORF with stopcodon in GDD 

motif 


pKG4333 


intermediate type with nptll 


129 kD ORF with stopcodon in P-loop 


pKG4334 


intermediate type 


57 kD ORF 


pKG4335 


intermediate type 


129 kD ORF with stopcodon in GDD 

motif 


pKG4336 


intermediate type 


129 kD ORF with stopcodon in P-loop 



3. Transformation of cucumber 

The intermediate type transformation vectors pKG433 1 and pKG4333 were 
introduced into Agrobacterium tumefaciens strain GV2260 by tri-parental mating. 

1 5 Transconjugants which had incorporated the intermediate type vector into their Ti-plasmids 
through homologous recombination were selected on the basis of streptomycin and 
spectinomycin resistance and analyzed for the correct insertion of the vector. 
Cucumber plants were transformed with these two strains of Agrobacterium, as well as with 
an Agrobacterium strain harboviring only the nptll selection marker, using published 

20 procedures. A number of transgenic cucumber plants were obtained. The plants were 

transferred to a greenhouse to flower and set seed. The seedlings germinating from these Rl 
seed were mechanically infected with CGMMV isolate 1-3 weeks post-moculation, the plants 
were scored for symptoms of virus infection, as described in the assay for tolerance to virus 
infection set out in under 4. below. 
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4. Assay for tolerance to virus infection 

The seedlings of transgenic cucumber germinating from these Rl seed were 
mechanically infected with CGMMV isolate 1. Fresh inoculum was prepared from a crude 
leaf extract of susceptible non-transgenic cucumber plemts cv. Hokus pre-infected with this 
5 same isolate 3 weeks previously. Seedlings of non-transformed cucumber plants were used as 
controls in the assay. During 21 days post-inoculation the appearance of viral symptoms was 
scored visually every 2 days. In this assay, individual plants are scored as being tolerant when 
they remain free of visible symptoms for at least 7 days, and preferably more than 14 days, 
and more preferably more than 21 days post-inoculation. 

10 Sixty-four independent transgenic lines were assayed, with 14 to 20 seedlings for each 

line. Control seedlings all became diseased within 9 days post-inoculation. A number of 
seedlings in seventeen of the transgenic lines showed clear absence of symptoms for a 
prolonged period of time, and remained free of symptoms after 21 days post-inoculation. Of 
some transgenic lines, the number of symptom-free plants coresponded to Mendelian 

15 segregation of a transgene present in a single locus. In one particular transgenic cucumber line 
4 out of 14 seedlings remained symptom-free during the assay period, which may indicate 
that the tolerant phenotype corresponds to the homozygous state of a transgene present in one 
single locus, although, as mentioned above, the invention is not limited to a specific 
mechanism. 

20 

Hereinabove, the invention has been described under the assumption that resistance 
against CGMMV is generated "at the protein level", i.e. that the "nucleotide sequence 
encoding a defective variant of the replicase gene of CGMMV" codes for a "defective 
replicase", the expression of which at cellular level generates the desired resistance against 

25 CGMMV. However, the invention is not limited to any explanation or mechanism, and is not 
particularly limited to the use of a particular type of nucleotide sequence (i.e. encoding a 
"defective replicase"). For instance, an alternative mechanism is based on the assumption that 
the resistance to CGMMV is generated at the RNA-level, e.g. downregulation of gene 
expression due to RNA sequence homology. The use of any nucleotide sequence that can be 

30 used to provide resistance against CGMMV and that at the same time does not provide 
replicase activity, is envisaged by the invention. 
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CLAIMS 

1 . Method for generating resistance in a plant or in a plant cell against infection with 
CGMMV, said method comprising at least the step of transforming said plant or plant cell 

5 with a polynucleotide sequence that 

i) upon (at least) transformation into a plant and transcription into RNA generates 
resistance against infection with CGMMV in said plant; and 

ii) upon (at least) transformation into a plant and transcription into RNA does not lead to 
generation of (any) replicase activity in said plant. 

10 

2. Method for providing a transgenic plant and/or a transgenic plant cell that is 
resistant against infection with CGMMV, comprising at least the step of transforming said 
plant or plant cell with a polynucleotide sequence that 

i) upon (at least) transformation into a plant and transcription into RNA generates 
15 resistance against infection with CGMMV in said plant; and 

ii) upon (at least) transformation into a plant and transcription into RNA does not lead to 
generation of (any) replicase activity in said plant. 

3. Method according to claim 1 and/or 2, in which the polynucleotide sequence is a 
20 nucleotide sequence that comprises, and preferably consists of: 

- a nucleotide sequence corresponding to the sequence given in SEQ ID no. 1 , to the 
sequence given in SEQ ID no. 17 or to the nucleotide sequence of a naturally occuring 
variant thereof, in which - compared to the sequence of SEQ ID no. 1 , SEQ ID no. 1 7 
and/or the naturally occuring variant thereof - one or more nucleotides have been 

25 added, replaced and/or removed; 

- a nucleotide sequence corresponding to the sequence given in SEQ ID no.5, to the 
sequence given in SEQ ID no.21, to the nucleotide sequence of a naturally occuring 
variant thereof, in which - compared to the sequence of SEQ ID no.5, SEQ ID no. 21 
and/or the nattirally occuring variant thereof - one or more nucleotides have been 

30 added, replaced and/or removed; 

- a nucleotide sequence corresponding to the sequence given in SEQ ID no. 3, to the 
sequence given in SEQ ID no. 19 or to the nucleotide sequence of a naturally occuring 
variant thereof; 
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- a nucleotide sequence corresponding to the sequence given in SEQ ID no. 3, to the 
sequence given in SEQ ID no. 19 or to the nucleotide sequence of a naturally occuring 
variant thereof, in which - compared to the sequence of SEQ ID no. 3, SEQ ID no. 19 
and/or the naturally occuring variant thereof - one or more nucleotides have been 
added, replaced and/or removed; 

such that said nucleotide sequence is capable, upon (at least) transformation into a plant and 
transcription into RNA, to confer to said plant resistance against infection with CGMMV, and 
such that said nucleotide sequence, upon (at least) transformation into a plant and 
transcription into RNA, is not capable of generating of (any) replicase activity in said plant. 

4. Method according to any of claims 1-3, in which the polynucleotide sequence 
encodes a polypeptide or protein that 

i) upon being expressed in a plant is capable of generating resistance against CGMMV 
in said plant; and 

ii) upon being expressed in a plant has no replicase activity. 

5. Method according to claim 4, in which the polynucleotide sequence encodes a 
protein or polypeptide that comprises, and preferably consists of: 

- an amino acid sequence corresponding to the sequence given in SEQ ID no. 2, to the 
sequence given in SEQ ID no. 18 or to the amino acid sequence of a naturally occuring 
variant thereof, in which - compared to the sequence of SEQ ID no.2, SEQ ID no. 18 
and/or the naturally occuring variant thereof - one or more amino acids have been 
added, replaced or removed, preferably replaced or removed, more preferably 
removed; 

- an amino acid sequence corresponding to the sequence given in SEQ ID no. 6, to the 
sequence given in SEQ ID no.22 or to the amino acid sequence of a naturally occuring 
variant thereof, in which - compared to the sequence of SEQ ID no. 6, SEQ ID no. 22 
and/or the naturally occuring variant thereof - one or more amino acids have been 
added, replaced or removed, preferably replaced or removed, more preferably 
removed 

- an amino acid sequence corresponding to the sequence given in SEQ ID no. 4, to the 
sequence given in SEQ ID no.20 or to the amino acid sequence of a naturally occuring 

variant thereof; 
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- an amino acid sequence corresponding to the sequence given in SEQ ID no.4, to the 
sequence given in SEQ ID no.20 or to the amino acid sequence of a naturally occuring 
variant thereof, in which - compared to the sequence of SEQ ID no.4, SEQ ID no. 20 
and/or the naturally occuring variant thereof - one or more amino acids have been 

5 added, replaced or removed, preferably replaced or removed, more preferably 

removed; 

or any combination thereof, provided that the resulting protein or polypeptide shows no 
replicase activity, but is still capable - upon expression in a plant - to generate resistance 
against CGMMV in said plant. 

10 

6. Method according to claim 4 or 5, in which the polynucleotide sequence encodes a 
protein or polypeptide that comprises, and preferably consists of : 

- an amino acid sequence corresponding to a part or fragment of the sequence given in 
SEQ ID no.2, to a part or fragment of the sequence given in SEQ ID no. 18 and/or to a 

1 5 part of fragment of the amino acid sequence of a naturally occuring variant thereof, or 

corresponding to a combination of two or more such parts or fragments; 

- an amino acid sequence corresponding to a part or fragment of the sequence given in 
SEQ ID no.6, to a part or fragment of the sequence given in SEQ ID no.22 and/or to a 
part of fragment of the amino acid sequence of a naturally occuring variant thereof, or 

20 corresponding to a combination of two or more such parts or fragments; ts; 

- an amino acid sequence corresponding to the sequence given in SEQ ID no.4, to the 
sequence given in SEQ ID no.20, and/or to the amino acid sequence of a naturally 
occuring variant thereof; 

such that the resulting protein or polypeptide shows no replicase activity, but is still capable - 
25 upon expression in a plant - to generate resistance against CGMMV in said plant. 

7. Method according to claim 4 or 5, in which the polynucleotide sequence encodes a 
protein or polypeptide that comprises, and preferably consists of : 

- an amino acid sequence corresponding to a truncated variant of the sequence given in 
30 SEQ ID no.2, to a truncated variant of the sequence given in SEQ ID no. 1 8 and/or to a 

truncated variant of the amino acid sequence of a naturally occuring variant thereof; 
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- an amino acid sequence corresponding to a truncated variant of the sequence given in 
SEQ ID no. 6, to a truncated variant of the sequence given in SEQ ID no. 22 and/or to a 
truncated variant of the amino acid sequence of a naturally occuring variant thereof; 

- an amino acid sequence corresponding to the sequence given in SEQ ID no. 4, to the 
5 sequence given in SEQ ID no 20, and/or to the amino acid sequence of a naturally 

occuring variant thereof; 
such that the resulting protein or polypeptide shows no replicase activity, but is still capable - 
upon expression in a plant - to generate resistance against CGMMV in said plant. 

10 8. Method according to claim 7, in which the polynucleotide sequence encodes a 

protein or polypeptide that comprises, and preferably consists of, the amino acid sequence 
given in SEQ ID no.2, the amino acid sequence given in SEQ ID no 18 and/or the amino acid 
sequence of a naturally occuring variant thereof, that has been truncated in the GDD-motif or 
in the P-loop. 

15 

9. Method according to any of the preceding claims, further comprising at least one 
step of cultivating the transformed plant cell into a mature plant 



10. Method according to any of the preceding claims, further comprising at least one 
20 step of sexually or asexually reproducing or multiplying the transformed plant and/or the 
mature plant obtained from the transformed plant cell of claim 9. 



1 1 . Method according to any of the preceding claims, in which the plant is a plant that 
is susceptible to infection with CGMMV, more preferably a plant belonging to the 

25 Cucurbitaceae family, such as melon (Cucumis melo), cucumber {C.sativus), watermelon 
{Citrullus vulgaris) and bottlegourd {Lagenaria siceraria) 

12. Genetic construct suitable for transforming a plant, said construct at least 
comprising nucleotide sequence that 

30 i) upon (at least) transformation into a plant and transcription into RNA generates 

resistance against infection with CGMMV in said plant; and 
ii) upon (at least) transformation into a plant and transcription into RNA does not lead to 

generation of (any) replicase activity in said plant; 
and optionally comprising further elements of genetic constructs known per se. 
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13. Genetic construct according to claim 12. at least comprising a nucleotide sequence 
that comprises, and preferably consists of: 

- a nucleotide sequence corresponding to the sequence given in SEQ ID no. 1 , to the 

5 sequence given in SEQ ID no. 1 7 or to the nucleotide sequence of a naturally occuring 

variant thereof in which - compared to the sequence of SEQ ID no.l, SEQ ID no. 17 
and/or the naturally occuring variant thereof - one or more nucleotides have been 
added, replaced and/or removed; 

- a nucleotide sequence corresponding to the sequence given in SEQ ID no.5, to the 
10 sequence given in SEQ ID no.21, to the nucleotide sequence of a naturally occuring 

variant thereof, in which - compared to the sequence of SEQ ID ho.5, SEQ ID no. 21 
and/or the naturally occuring variant thereof - one or more nucleotides have been 
added, replaced and/or removed; 

- a nucleotide sequence corresponding to the sequence given in SEQ ID no. 3, to the 

1 5 sequence given in SEQ ID no. 1 9 or to the nucleotide sequence of a naturally occuring 

variant thereof; 

- a nucleotide sequence corresponding to the sequence given in SEQ ID no.3, to the 
sequence given in SEQ ID no. 19 or to the nucleotide sequence of a naturally occuring 
variant thereof, in which - compared to the sequence of SEQ ID no.3, SEQ ID no. 19 

20 and/or the naturally occuring variant thereof - one or more nucleotides have been 

added, replaced and/or removed; 
such that said construct is capable, upon (at least) transformation into a plant and transcription 
into RNA, to confer to said plant resistance against infection with CGMMV, and such that 
said construct, upon (at least) transformation into a plant and transcription into RNA, is not 

25 capable of generating of (any) replicase activity in said plant. 

14. Genetic construct according to claim 12 or 13, at least comprising a nucleotide 
sequence that encodes a polypeptide or protein that 

i) upon being expressed in a plant is capable of generating resistance against CGMMV 
30 in said plant; and 

ii) upon being expressed in a plant has no replicase activity. 
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15. Genetic construct according to claim 14, in which the nucleotide sequence encodes 
a protein or polypeptide that comprises, and preferably consists of: 

- an amino acid sequence corresponding to the sequence given in SEQ ID no.2, to the 
sequence given in SEQ ID no. 18 or to the amino acid sequence of a naturally occuring 

5 variant thereof, in which - compared to the sequence of SEQ ID no.2, SEQ ID no. 1 8 

and/or the naturally occuring variant thereof - one or more ammo acids have been 
added, replaced or removed, preferably replaced or removed, more preferably 
removed; 

- an amino acid sequence corresponding to the sequence given in SEQ ID no. 6, to the 
10 sequence given in SEQ ID no.22 or to the amino acid sequence of a naturally occuring 

variant thereof, in which - compared to the sequence of SEQ ID no.6, SEQ ID no. 22 
and/or the naturally occuring variant thereof - one or more amino acids have been 
added, replaced or removed, preferably replaced or removed, more preferably 
removed 

15 - an amino acid sequence corresponding to the sequence given in SEQ ID no.4, to the 

sequence given in SEQ ID no.20 or to the amino acid sequence of a naturally occuring 
variant thereof; 

- an amino acid sequence corresponding to the sequence given in SEQ ID no.4, to the 
sequence given in SEQ ID no.20 or to the amino acid sequence of a naturally occuring 

20 variant thereof, in which - compared to the sequence of SEQ ID no.4, SEQ ID no. 20 

and/or the naturally occuring variant thereof - one or more amino acids have been 
added, replaced or removed, preferably replaced or removed, more preferably 
removed; 

or any combination thereof, provided that the resulting protein or polypeptide shows no 
25 replicase activity, but is still capable - upon expression in a plant - to generate resistance 
against CGMMV in said plant. 

16. Genetic construct according to claim 14 or 15, in which the nucleotide sequence 
encodes a protein or polypeptide that comprises, and preferably consists of : 

30 - an amino acid sequence corresponding to a part or fragment of the sequence given in 
SEQ ID no.2, to a part or fragment of the sequence given in SEQ ID no. 1 8 and/or to a 
part of fragment of the amino acid sequence of a naturally occuring variant thereof, or 
corresponding to a combination of two or more such parts or fragments; 
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- an amino acid sequence corresponding to a part or fragment of the sequence given in 
SEQ ID no.6, to a part or fragment of the sequence given in SEQ ID no.22 and/or to a 
part of fragment of the amino acid sequence of a naturally occuring variant thereof, or 
corresponding to a combination of two or more such parts or fragments; ts; 

5 - an amino acid sequence corresponding to the sequence given in SEQ ID no.4, to the 
sequence given in SEQ ID no.20, and/or to the amino acid sequence of a naturally 
occuring variant thereof; 
such that the resulting protein or polypeptide shows no replicase activity, but is still capable - 
upon expression in a plant - to generate resistance against CGMMV in said plant. 

10 

17. Genetic construct according to any of claims 14-16, in which the nucleotide 
sequence encodes a protein or polypeptide that comprises, and preferably consists of : 

- an amino acid sequence corresponding to a truncated variant of the sequence given in 
SEQ ID no.2, to a truncated variant of the sequence given in SEQ ID no. 1 8 and/or to a 

15 truncated variant of the amino acid sequence of a naturally occuring variant thereof; 

- an amino acid sequence corresponding to a truncated variant of the sequence given in 
SEQ ID no.6, to a truncated variant of the sequence given in SEQ ID no.22 and/or to a 
truncated variant of the amino acid sequence of a naturally occuring variant thereof; 

- an amino acid sequence corresponding to the sequence given in SEQ ID no.4, to the 
20 sequence given in SEQ ID no 20, and/or to the amino acid sequence of a naturally 

occuring variant thereof; 
such that the resulting protein or polypeptide shows no replicase activity, but is still capable - 
upon expression in a plant - to generate resistance against CGMMV in said plant. 

25 18. Genetic construct according to claim 1 7, in which the nucleotide sequence encodes 

a protein or polypeptide that comprises, and preferably consists of, the amino acid sequence 
given in SEQ ID no.2, the amino acid sequence given in SEQ ID no. 18 and/or the amino acid 
sequence of a naturally occuring variant thereof, that has been truncated in the GDD-motif or 
in the P-loop. 

30 

19. Genetic construct according to any of claims 12-18, in which the nucleotide 
sequence is under control of the plastocyanine-promoter. 
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20. Genetic construct according to any of claims 12-19, in which the nucleotide 
sequence is preceded by the native CGMMV leader (5'-UTR) sequence. 

21 . Genetic construct according to any of claims 12-20, in a form that can be stably 
5 maintained or inherited in a micro-organism, in particular a bacterium, more in particular a 

bacterium that can be used to transform a plant or plant material, such as Agrobacterium. 

22. Micro-organism, in particular bacterium, more in particular a bacterium that can be 
used to transform a plant, such as Agrobacterium, that contains a genetic construct according 

10 to any of claims 12-21, and in particular according to claim 21 . 

23. Transgenic plant or plant cell, obtainable or obtained by a method according to one 
of claims 2-1 1, or a descendant of such a plant. 

1 5 24. Plant, plant cell or plant material that has been transformed with genetic construct 

according to any of claims 12-21, or a descendant of such a plant. 

25. Plant according to claim 24 or 25, being a plant that is susceptible to infection with 
CGMMV, more preferably a plant belonging to the Cucurbitaceae family, such as melon 

20 {Cucumis meld), cucumber {C.sativus), watermelon {Citrullus vulgaris) and bottlegourd 
{Lagenaria sicerarid) 

26. Cultivation material such as seed, tubers, roots, stalks, seedlings for a plant 
according to claim 23, 24 or 25. 

25 
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SEQUENCE LISTING 
<110> Keygene N.V. 

<120> Method for generating CGMMV resistance in plants, 
genetic constructs for use in said method, and 
CGMMV-resistant plants obtained via said method 

<130> BO 42790 

<140> 
<141> 

<160> 44 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 3432 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: nucleotide 
sequence corresponding to 129 kD replicase of 
CGMMV isolate 4 . 



<400> 1 

atggcaaaca 

aatctcgtta 

catcaagata 

cttgtcaccg 

cattcccttg 

tatggttcac 

tcatatgtgc 

tacaacgaca 

cttccgactt 

tgtccagacg 

gtttcattac 

aaaaacgtca 

cccgtgggra 

tttcttttta 

attgtaatgc 

gttaagcgtg 

aaatctgtag 

attttccaag 

atacctaagt 

gtcgatgctg 

ttagtgtgga 

ggagtttccg 

tcgatattcc 

gtgatcgagg 

ggactaggtg 

gacgaactga 

atgtacgagg 

gatgatctgt 

gtggagaaat 

gtgatcgaag 

tctcctgaga 

atggacgtaa 

atttctccag 



ttaatgaaca 
gtcagctagc 
gacgcccaaa 
atgcgtatcc 

cgggaggttt 
cttgctttga 
attgctgcaa 
tgatcacaca 
tccagataga 
tttttcaaga 
attcgattta 
gggtctgtra 
atttaaatag 
gtgaggagtc 
gtacttattt 
tagacacttt 
ggcactattc 
ataaggccac 
ttgaactctc 
attttgtcca 
agaatgtcca 
taaaatctga 
ttctcgtgaa 
cgaggggttt 
attgcgtcta 
aagtattact 
cagatgcaaa 
tcaagaagat 
ttcaagaatt 
ctattttttc 
tgggtgcttc 
ctgaggatat 
aaatgtcgag 



aatcaacaat 
atcaaagagg 

aatgaacttt 
ggagrtttcg 
gaggcttctt 
tattggcggt 
tccgtgcctg 
acatgtacag 
tgctttcagg 
atgctcctat 
tgatatccct 
cgcagccttt 
tataggggct 
aactttgcat 
ccctgctgat 
tttttttagg 
gaagtcgaaa 
gttttctgtg 
gagatttctt 
taccattatt 
gtcttttgta 
atggaatgta 
ggttagaaag 
gcttcggagg 
tgatgctcta 
acctgaaccg 
aattgagaga 
tgacgaaata 
ctgtaaagaa 
acagaaggca 
cgttgcgtta 
ggaggaaata 
atgggctagt 



caacgtgatg 
gtgtatgacg 
tctcgtgtgg 
attagtttca 
gaattggaat 
aattacacgc 
gatcttaagg 
aggcacaaag 
aggtatgaag 
gattttggga 
tattcttcga 
catttctcgg 
caatttaggg 
tacactcata 
gataggttcg 
ttagttaggg 
tctgagtatt 
tggtttcccg 
tctggaaatg 
aatcacatta 
gaatctatac 
ccggtcgatc 
gtgcagattg 
ttcgctgata 
gttcaaaccg 
tttatgacct 
gagagtgtct 
aggaataatt 
ctgaatgtta 
ggggtaacag 
tccaataatt 
gtgttgatag 
atgaaatacg 



ctgctgctag 
aggccgttcg 
tcagtacaga 
ccgctaccaa 

ciCatigStig3t 

agcatttatt 
atgttgcgag 
gatctggtgg 
attcgcccgt 
gtggtaggga 
trgggccagc 
aggcgttgct 
tr gatggtga 
gtttggagaa 
tgtatattaa 
cagacacaca 
ttgcgttgaa 
aagcgaagcg 
tgaaagtctc 
gcacgtacga 
gctctagggt 
agcttactga 
agttaatgtc 
gtctcaaatc 
grtggtttga 
tttcagatta 
ctgagctgct 
acagcggagt 
atcctatgct 
tcacgggcct 
ctgtagatac 
cagacaagaa 
gcaataataa 



cgggagaaat 
ctcgttagat 
gcacaccagg 
gaattcagtt 
gcaggtgcct 
taaaggtaga 
gaatgtgatg 
gtgtagacct 
cgcagtcacc 
taatcatgcg 
tcttcatagg 
cctaggttcg 
cgatgtgcat 
tattaagttg 
ggagtttatg 
tatgctccat 
cacccctccg 
gaaggtgttg 
taggatgctt 
taacaaggcc 
aattgtaaac 
tatctcattc 
tgataaggtt 
cgccgtagaa 
cacctctagc 
tctcgaaggg 
tgcttccgga 
tgaatttgat 
aatcggtcat 
aggcacgctc 
atgtgatgat 
tcactcttat 
cggggcctta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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grtgagtaca 
gctgttttac 
gaggaggacg 
aagaagacta 
tatatcgatt 
aggagtgact 
aagtgcaagt 
agtgatgata 
gacaagaaaa 
cgtagtttag 
gtcgatgggg 
actgaccttg 
gccctacaca 
atgaataaga 
acggggttgc 
gatgctaagc 
agaactttga 
gtcactagtt 
cattccgtga 
atcaaaggga 
tacaatgatg 
gtgcgtgcaa 
gctttaacgc 
agcataatag 
gtgcctacca 



aggtcggaac 
cgttgtcggg 
acttgaggtt 
tcactccagt 
atctatcggc 
ggaatggtac 
ggttactgtt 
ccactcgtat 
attggaagcg 
aagtcttaaa 
ttccgggctg 
tgttgactcc 
agtcacctgt 
aggtttttaa 
tcaactttgc 
aaattccgtt 
tagttgataa 
ttcttaatac 
aggcaataaa 
agatcataac 
tgaatactgt 
caccgactcc 
ggcacaccaa 
cggatgtgga 
aa 



ctcgatgact 
aatctgtgta 
atcaaacatg 
tgtttatact 
ttctctgggt 
cgaggagagc 
gccatcggag 
aatctttctg 
gttcgctgtc 
taaggaggcc 
tggaaaaacc 
cggaagggaa 
agctactagt 
atttgacgcc 
gttgaagatt 
tattaataga 
tgtagagcgt 
tatttataaa 
ggtttctggg 
gtttactcag 
gcatgagatt 
aataggtctg 
ggcaatggtg 
aaaggtcgat 



ttacctgcca 
agaaagcccc 
aatttcttta 
gggaccattc 
tctacgcttg 
atgcaaactt 
aagaaacacg 
tcgtatgacg 
tgttctgata 
acagtcgatc 
gctgaaatta 
gcggctgcta 
gataacgtta 
gtctacgtag 
tcgggttgta 
gttatgaatt 
aggtatatta 
gctgcggttt 
gctggtattc 
tctgataaac 
cagggggaga 
attgcccgag 
tattataccg 
cagtcgattt 



cctgggcaga 
aattttcaaa 
aggtgagtga 
gagagaggca 
gtaatcttga 
ttggattgta 
cctgggctgt 
aatccggttc 
ccaaagttta 
ctggggtgta 
tagcgagggt 
tgatcaggcg 
ggacttttga 
atgaaggtct 
aaaaggcctt 
ttgattatcc 
cccataggtg 
ctaccactag 
tgaggcccga 
aatccttgat 
cctttgagga 
attcaccaca 
ttgtgttcga 
tgactatgtt 



aaagggtaag 
gccactcgat 
tctgaagttg 
gatgaagaat 
gagaattgtt 
cgattgcgag 
agtcctggcg 
tcctataatt 
tagtgtaatt 
tataacttta 
caattggaaa 
aagagcctgt 
ttctttcgta 
tatggtccac 
tgtcttcggt 
taaggaatta 
tcctagagat 
tccggttgta 
gttgacgaag 
caagagtggg 
gacggcggtt 
cgtgtragtg 
tgccgtaaca 
tgctactact 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3432 



<210> 2 
<211> 1144 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amino acid 
sequence corresponding to 129 kD replicase of 
CGMMV isolate 4 

<400> 2 

Met Ala Asn lie Asn Glu Gin He Asn Asn Gin Arg Asp Ala Ala Ala 
15 10 15 

Ser Gly Arg Asn Asn Leu Val Ser Gin Leu Ala Ser Lys Arg Val Tyr 
20 25 30 

Asp Glu Ala Val Arg Ser Leu Asp His Gin Asp Arg Arg Pro Lys Met 
35 40 45 

Asn Phe Ser Arg Val Val Ser Thr Glu His Thr Arg Leu Val Thr Asp 
50 55 60 

Ala Tyr Pro Glu Phe Ser He Ser Phe Thr Ala Thr Lys Asn Ser Val 
65 70 75 80 

His Ser Leu Ala Gly Gly Leu Arg Leu Leu Glu Leu Glu Tyr Met Met 
85 90 95 

Met Gin Val Pro Tyr Gly Ser Pro Cys Phe Asp He Gly Gly Asn Tyr 
100 105 110 



Thr Gin His Leu Phe Lys Gly Arg Ser Tyr Val His Cys Cys Asn Pro 
115 120 125 
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Cys Leu Asp Leu 

130 

lie Thr Gin His 

145 

Leu Pro Thr Phe 



Val Ala Val Thr 
180 

Gly Ser Gly Arg 
195 

lie Pro Tyr Ser 
210 

Val Cys Tyr Ala 
225 

Pro Val Gly Asn 



Asp Asp Val His 
260 

His Ser Leu Glu 
275 

Ala Asp Asp Arg 
290 

Asp Thr Phe Phe 
305 

Lys Ser Val Gly 



Asn Thr Pro Pro 
340 

Pro Glu Ala Lys 
355 

Phe Leu Ser Gly 
370 

Phe Val His Thr 
385 

Leu Val Trp Lys 



Val He Val Asn 
420 

Asp Gin Leu Thr 

435 



Lys Asp Val Ala 

135 

Val Gin Arg His 
150 

Gin He Asp Ala 
165 

Cys Pro Asp Val 



Asp Asn His Ala 
200 

Ser He Gly Pro 
215 

Ala Phe His Phe 
230 

Leu Asn Ser He 

245 

Phe Leu Phe Ser 



Asn He Lys Leu 
280 

Phe Val Tyr He 
295 

Phe Arg Leu Val 
310 

His Tyr Ser Lys 
325 

He Phe Gin Asp 



Arg Lys Val Leu 

360 

Asn Val Lys Val 
375 

He He Asn His 
390 

Asn Val Gin Ser 
405 

Gly Val Ser Val 



Asp He Ser Phe 
440 



Arg Asn Val Met 

140 

Lys Gly Ser Gly 
155 

Phe Arg Arg Tyr 
170 

Phe Gin Glu Cys 
185 

Val Ser Leu His 



Ala Leu His Arg 
220 

Ser Glu Ala Leu 
235 

Gly Ala Gin Phe 

250 

Glu Glu Ser Thr 
265 

He Val Met Arg 



Lys Glu Phe Met 
300 

Arg Ala Asp Thr 
315 

Ser Lys Ser Glu 
330 

Lys Ala Thr Phe 
345 

He Pro Lys Phe 



Ser Arg Met Leu 
380 

He Ser Thr Tyr 
395 

Phe Val Glu Ser 
410 

Lys Ser Glu Trp 
425 

Ser He Phe Leu 



Tyr Asn Asp Met 



Gly Cys Arg Pro 
160 

Glu Asp Ser Pro 
175 

Ser Tyr Asp Phe 
190 

Ser He Tyr Asp 
205 

Lys Asn Val Arg 



Leu Leu Gly Ser 
240 

Arg Val Asp Gly 
255 

Leu His Tyr Thr 
270 

Thr Tyr Phe Pro 
285 

Val Lys Arg Val 



His Met Leu His 
320 

Tyr Phe Ala Leu 
335 

Ser Val Trp Phe 
350 

Glu Leu Ser Arg 
365 

Val Asp Ala Asp 



Asp Asn Lys Ala 
400 

He Arg Ser Arg 
415 

Asn Val Pro Val 
430 

Leu Val Lys Val 
445 
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Arg Lys Val Gin 
450 

Arg Gly Leu Leu 
465 

Gly Leu Gly Asp 



Asp Thr Ser Ser 
500 

Thr Phe Ser Asp 
515 

Glu Arg Glu Ser 
530 

Lys Lys lie Asp 
545 

Val Glu Lys Phe 



Leu lie Gly His 
580 

Thr Val Thr Gly 
595 

Ala Leu Ser Asn 

610 

Glu Asp Met Glu 
625 

He Ser Pro Glu 



Asn Gly Ala Leu 
660 

Ala Thr Trp Ala 
675 

Cys Val Arg Lys 
690 

Leu Arg Leu Ser 

705 

Lys Lys Thr lie 



Gin Met Lys Asn 
740 

Leu Gly Asn Leu 
755 



He Glu Leu Met 
455 

Arg Arg Phe Ala 
470 

Cys Val Tyr Asp 
485 

Asp Glu Leu Lys 



Tyr Leu Glu Gly 
520 

Val Ser Glu Leu 
535 

Glu He Arg Asn 
550 

Gin Glu Phe Cys 
565 

Val He Glu Ala 



Leu Gly Thr Leu 
600 

Asn Ser Val Asp 
615 

Glu He Val Leu 
630 

Met Ser Arg Trp 
645 

Val Glu Tyr Lys 



Glu Lys Gly Lys 
680 

Pro Gin Phe Ser 
695 

Asn Met Asn Phe 
710 

Thr Pro Val Val 

725 

Tyr He Asp Tyr 



Glu Arg He Val 
760 



Ser Asp Lys Val 
460 

Asp Ser Leu Lys 
475 

Ala Leu Val Gin 
490 

Val Leu Leu Pro 

505 

Met Tyr Glu Ala 



Leu Ala Ser Gly 
540 

Asn Tyr Ser Gly 
555 

Lys Glu Leu Asn 
570 

He Phe Ser Gin 
585 

Ser Pro Glu Met 



Thr Cys Asp Asp 
620 

He Ala Asp Lys 
635 

Ala Ser Met Lys 
650 

Val Gly Thr Ser 
665 

Ala Val Leu Pro 



Lys Pro Leu Asp 
700 

Phe Lys Val Ser 
715 

Tyr Thr Gly Thr 
730 

Leu Ser Ala Ser 
745 

Arg Ser Asp Trp 



Val He Glu Ala 



Ser Ala Val Glu 
480 

Thr Gly Trp Phe 
495 

Glu Pro Phe Met 
510 

Asp Ala Lys He 
525 

Asp Asp Leu Phe 



Val Glu Phe Asp 
560 

Val Asn Pro Met 
575 

Lys Ala Gly Val 
590 

Gly Ala Ser Val 
605 

Met Asp Val Thr 



Asn Kis Ser Tyr 
640 

Tyr Gly Asn Asn 
655 

Met Thr Leu Pro 
670 

Leu Ser Gly He 
685 

Glu Glu Asp Asp 



Asp Leu Lys Leu 

720 

He Arg Glu Arg 
735 

Leu Gly Ser Thr 
750 

Asn Gly Thr Glu 
765 



Glu Ser Met Gin Thr Phe Gly Leu Tyr Asp Cys Glu Lys Cys Lys Trp 
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770 775 780 

Leu Leu Leu Pro Ser Glu Lys Lys His Ala Trp Ala Val Val Leu Ala 
785 790 795 800 

Ser Asp Asp Thr Thr Arg lie lie Phe Leu Ser Tyr Asp Glu Ser Gly 
805 810 815 

Ser Pro lie lie Asp Lys Lys Asn Trp Lys Arg Phe Ala Val Cys Ser 
820 825 830 

Asp Thr Lys Val Tyr Ser Val lie Arg Ser Leu Glu Val Leu Asn Lys 
835 840 845 

Glu Ala Thr Val Asp Pro Gly Val Tyr lie Thr Leu Val Asp Gly Val 
850 855 860 

Pro Gly Cys Gly Lys Thr Ala Glu He He Ala Arg Val Asn Trp Lys 

865 870 875 . 880 

Thr Asp Leu Val Leu Thr Pro Gly Arg Glu Ala Ala Ala Met He Arg 
885 890 895 

Arg Arg Ala Cys Ala Leu His Lys Ser Pro Val Ala Thr Ser Asp Asn 
900 905 910 

Val Arg Thr Phe Asp Ser Phe Vai Met Asn Lys Lys Val Phe Lys Phe 
915 920 925 

Asp Ala Val Tyr Val Asp Glu Gly Leu Met Val His Thr Gly Leu Leu 
930 935 940 

Asn Phe Ala Leu Lys He Ser Gly Cys Lys Lys Ala Phe Val Phe Gly 
945 950 955 960 

Asp Ala Lys Gin He Pro Phe He Asn Arg Val Met Asn Phe Asp Tyr 
965 970 975 

Pro Lys Glu Leu Arg Thr Leu He Val Asp Asn Val Glu Arg Arg Tyr 

980 985 990 

He Thr His Arg Cys Pro Arg Asp Val Thr Ser Phe Leu Asn Thr He 
995 1000 1005 

Tyr Lys Ala Ala Val Ser Thr Thr Ser Pro Val Val His Ser Val Lys 
1010 1015 1020 

Ala He Lys Val Ser Gly Ala Gly He Leu Arg Pro Glu Leu Thr Lys 
1025 1030 1035 1040 

He Lys Gly Lys He He Thr Phe Thr Gin Ser Asp Lys Gin Ser Leu 
1045 1050 1055 

He Lys Ser Gly Tyr Asn Asp Val Asn Thr Val His Glu He Gin Gly 
1060 ' 1065 1070 

Glu Thr Phe Glu Glu Thr Ala Val Val Arg Ala Thr Pro Thr Pro He 
1075 1080 1085 



Gly Leu He Ala Arg Asp Ser Pro His Val Leu Val Ala Leu Thr Arg 
1090 1095 1100 
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His Thr Lys Ala Met Val Tyr Tyr Thr Val Val Phe Asp Ala Val Thr 

1105 ' 1110 1115 1120 

Ser lie lie Ala Asp Val Glu Lys Val Asp Gin Ser lie Leu Thr Met 

1125 1130 1135 



Phe Ala Thr Thr Val Pro Thr Lys 

1140 



<210> 3 

<211> 1503 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to 57 kD protein of CGMMV 
isolate 4 



<400> 3 

atggagaatt cgctgtatgt ccaccgcaat atcttcctcc ctgttactaa gacagggttt 60 

tatacggata tgcaggagtt ctatgacagg tgtcttccag ggaattcttt tgttctgaac 120 

gatttcgatg ccgtcaccat gcggttgagg gataatgaat tcaatttgca accttgtaga 180 

ttaactttaa gtaatttaga tccggtgccg gctttgatta agagtgaggc aaaagatttt 240 

ctggttcccg tattgcgaac ggcttgcgaa aggccgcgta ttccgggtct tctcgaaaat 300 

cttgttgcta tgataaagag gaatatgaat actcctgatt tggctgggac cgtggatata 360 

actaatatgt ctatttctat agtagataat ttcttttctt cctttgtcag ggacgaggtt 420 

ctacttgatc atttagattg cgttagagct agttctattc agagtttttc cgattggttt 4 80 

tcttgtcagc caacctcggc ggttggccag ttagctaatt ttaacttcat agatttacct 540 

gcctttgata cgtatatgca tatgattaaa aggcagccta agagtcggtt agatacttcg 600 

attcagtccg aatatccggc cttacaaact attgtatatc atccgaaggt ggtaaacgca 660 

gttttcgggc cggtttttaa gtatctgact actaagtttc ttagcatggt agataattct 720 

aagtttttct tttatactag gaaaaagcca gaggatctgc aggaattttt ctcggatctt 780 

tcttcccatt ctgattatga aattcttgag ctcgatgttt ctaaatatga taagtcgcag 840 

tccgatttcc atttctctat cgagatggca atttgggaaa ggctgggact agatgatatt 900 

ttagcrrgga tgtggtctat gggtcataag agaactatac tgcaagattt ccaagctgga 950 

ataaagacgc tcatttatta tcaaaggaag tctggcgacg taactacttt cataggtaat 1020 

acttttatta ttgcagcgtg tgtagctagt atgttaccgt tagataagtg ttttaaggct 1080 

agtt:;ttgtg gtgatgattc gttaatctac cttcctaagg gtttggagta tcctgatatt 1140 

caggctactg ccaatttggt ttggaatttt gaggcgaaac ttttccggaa gaagtatggt 1200 

tacttctgcg ggaaatatat cattcatcac gccaacggtt gtattgttta ccctgaccct 1260 

ttgaagttaa ttagtaaatt aggtagtaag agtcttgtag ggtacgagca tgtcgaggag 1320 

tttcgtatat ctctcctcga tgtcgctcac agtttgttta atggtgctta tttccatttg 1380 

ctcgacgatg caatccacga gttgtttcct aacgctgggg gttgcagttt tgtaataaat 1440 

tgtttgtgta agtacttgag tgataagcgc cttttccgta gtctttatat agatgtctct 1500 
aag 1503 



<210> 4 
<211> 501 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; amino acid 

sequence corresponding to 57 kO protein of CGMMV 
isolate 4 
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<400> 4 

Met Glu Asn Ser 

1 

Lys Thr Giy Phe 
20 

Pro Gly Asn Ser 



Leu Arg Asp Asn 
50 

Asn Leu Asp Pro 
65 

Leu Val Pro Val 



Leu Leu Glu Asn 
100 

Asp Leu Ala Gly 

115 

Asp Asn Phe Phe 
130 

Leu Asp Cys Val 
145 

Ser Cys Gin Pro 



lie Asp Leu Pro 
180 

Pro Lys Ser Arg 
195 

Gin Thr He Val 
210 

Val Phe Lys Tyr 

225 

Lys Phe Phe Phe 



Phe Ser Asp Leu 
260 

Val Ser Lys Tyr 
275 

Met Ala He Trp 
290 

Trp Ser Met Gly 
305 



Leu Tyr Val His 

5 

Tyr Thr Asp Met 



Phe Val Leu Asn 
40 

Glu Phe Asn Leu 
55 

Val Pro Ala Leu 
70 

Leu Arg Thr Ala 
85 

Leu Val Ala Met 



Thr Val Asp He 

120 

Ser Ser Phe Val 
135 

Arg Ala Ser Ser 
150 

Thr Ser Ala Val 
165 

Ala Phe Asp Thr 



Leu Asp Thr Ser 
200 

Tyr His Pro Lys 
215 

Leu Thr Thr Lys 

230 

Tyr Thr Arg Lys 
245 

Ser Ser His Ser 



Asp Lys Ser Gin 
280 

Glu Arg Leu Gly 
295 

His Lys Arg Thr 
310 



Arg Asn He Phe 
10 

Gin Glu Phe Tyr 
25 

Asp Phe Asp Ala 



Gin Pro Cys Arg 
60 

He Lys Ser Glu 
75 

Cys Glu Arg Pro 
90 

He Lys Arg Asn 
105 

Thr Asn Met Ser 



Arg Asp Glu Val 
140 

He Gin Ser Phe 
155 

Gly Gin Leu Ala 
170 

Tyr Met His Met 
185 

He Gin Ser Glu 



Val Val Asn Ala 
220 

Phe Leu Ser Met 
235 

Lys Pro Glu Asp 
250 

Asp Tyr Glu He 
265 

Ser Asp Phe His 



Leu Asp Asp He 
300 

He Leu Gin Asp 
315 



Leu Pro Val Thr 

15 

Asp Arg Cys Leu 

30 

Val Thr Met Arg 
45 

Leu Thr Leu Ser 



Ala Lys Asp Phe 
80 

Arg He Pro Gly 
95 

Met Asn Thr Pro 
110 

He Ser He Val 
125 

Leu Leu Asp His 



Ser Asp Trp Phe 
160 

Asn Phe Asn Phe 
175 

He Lys Arg Gin 
190 

Tyr Pro Ala Leu 
205 

Val Phe Gly Pro 



Val Asp Asn Ser 

240 

Leu Gin Glu Phe 
255 

Leu Glu Leu Asp 
270 

Phe Ser He Glu 
285 

Leu Ala Trp Met 



Phe Gin Ala Gly 
320 
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lie Lys Thr Leu lie Tyr Tyr Gin Arg Lys Ser Gly Asp Val Thr Thr 
, 325 330 335 

Phe He Gly Asn Thr Phe He He Ala Ala Cys Val Ala Ser Met Leu 
340 345 350 

Pro Leu Asp Lys Cys Phe Lys Ala Ser Phe Cys Gly Asp Asp Ser Leu 
355 360 365 

He Tyr Leu Pro Lys Gly Leu Glu Tyr Pro Asp He Gin Ala Thr Ala 
370 375 380 

Asn Leu Val Trp Asn Phe Glu Ala Lys Leu Phe Arg Lys Lys Tyr Gly 

385 390 395 400 

Tyr Phe Cys Gly Lys Tyr He He His His Ala Asn Gly Cys He Val 
405 410 415 

Tyr Pro Asp Pro Leu Lys Leu He Ser Lys Leu Gly Ser Lys Ser Leu 
420 425 430 

Val Gly Tyr Glu His Val Glu Glu Phe Arg He Ser Leu Leu Asp Val 
435 440 445 

Ala His Ser Leu Phe Asn Gly Ala Tyr Phe His Leu Leu Asp Asp Ala 
450 455 460 

He His Glu Leu Phe Pro Asn Ala Gly Gly Cys Ser Phe Val He Asn 
465 470 475 480 

Cys Leu Cys Lys Tyr Leu Ser Asp Lys Arg Leu Phe Arg Ser Leu Tyr 
485 ' 490 495 

He Asp Val Ser Lys 
500 



<210> 5 
<211> 4935 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to 186 kD protein of CGMMV 
isolate 4 



<400> 5 

atggcaaaca 
aatctcgtta 
catcaagata 
cttgtcaccg 
cattcccttg 
tatggttcac 
tcatatgtgc 
tacaacgaca 
cttccgactt 
tgtccagacg 
gtttcattac 
aaaaacgtca 



ttaatgaaca 
gtcagcragc 
gacgcccaaa 
atgcgtatcc 
cgggaggtitt 
cttgctr tga 
attgctgcaa 
tgatcacaca 
tccagaraga 
tttttcaaga 
attcgartta 
gggtcrgtta 



aatcaacaat 
atcaaagagg 
aatgaacttt 
ggagttttcg 
gaggcttctt 
tattggcggt 
tccgtgcctg 
acatgtacag 
tgctttcagg 
atgctcctat 
tgatatccct 
cgcagccttt 



caacgtgatg 
gtgtatgacg 
tctcgtgtgg 
attagtttca 
gaattggaat 
aattacacgc 
gatcttaagg 
aggcacaaag 
aggtatgaag 
gattttggga 
tattcttcga 
catttcrcgg 



ctgctgctag 
aggccgttcg 
tcagtacaga 
ccgctaccaa 
acatgatgat 
agcatttatt 
atgttgcgag 
gatctggtgg 
attcgcccgt 
gtggtaggga 
ttgggccagc 
aggcgttgct 



cgggagaaat 
ctcgttagat 
gcacaccagg 
gaattcagtt 
gcaggtgcct 
taaaggtaga 
gaatgtgatg 
gtgtagacct 
cgcagtcacc 
taatcatgcg 
tcttcatagg 
cctaggttcg 



60 

120 

180 

240 

300 

350 

420 

480 

540 

600 

660 

720 
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cccgtgggta 
tttcttttta 
attgtaatgc 
gttaagcgtg 
aaatctgtag 
attttccaag 
atacctaagt 
gtcgatgctg 
ttagtgtgga 
ggagtttccg 
tcgatattcc 
gtgatcgagg 
ggactaggtg 
gacgaactga 
atgtacgagg 
gatgatctgt 
gtggagaaat 
gtgatcgaag 
tctcctgaga 
atggacgtaa 
atttctccag 
gttgagtaca 
gctgttttac 
gaggaggacg 
aagaagacta 
tatatcgatt 
aggagtgact 
aagtgcaagt 
agtgatgata 
gacaagaaaa 
cgtagtttag 
gtcgatgggg 
actgaccttg 
gccctacaca 
atgaataaga 
acggggttgc 
gatgctaagc 
agaactttga 
gtcactagtt 
cattccgtga 
atcaaaggga 
tacaatgatg 
gtgcgtgcaa 
gctttaacgc 
agcataatag 
gtgcctacca 
aagacagggt 
tttgttctga 
caaccttgta 
gcaaaagatt 
cttctcgaaa 
accgtggata 
agggacgagg 
tccgattggt 
atagatttac 
ttagatactt 
gtggtaaacg 
gtagataatt 
ttctcggatc 
gataagtcgc 
ctagatgata 



atttaaatag 
gtgaggagtc 
gtacttattt 
tagacacttt 
ggcactattc 
ataaggccac 
ttgaactctc 
attttgtcca 
agaatgtcca 
taaaatctga 
ttctcgtgaa 
cgaggggttt 
attgcgtcta 
aagtattact 
cagatgcaaa 
tcaagaagat 
ttcaagaatt 
ctattttttc 
tgggtgcttc 
cr gaggatat 
aaatgtcgag 
aggtcggaac 
cgttgtcggg 
acttgaggtt 
tcactccagt 
arctatcggc 
ggaatggtac 
ggttactgtt 
ccactcgtat 
attggaagcg 
aagtcttaaa 
ttccgggctg 
tgttgactcc 
agtcacctgt 
aggtttttaa 
tcaactttgc 
aaattccgtt 
tagttgataa 
ttcttaatac 
aggcaataaa 
agatcataac 
tgaatactgt 
caccgactcc 
ggcacaccaa 
cggatgtgga 
aaatggagaa 
trtatacgga 
acgatttcga 
gattaacttt 
ttctggttcc 
atcttgttgc 
taactaatat 
ttctacttga 
tttcttgtca 
ctgcctttga 
cgattcagtc 
cagttttcgg 
ctaagttttt 
tttcttccca 
agtccgattt 
ttttagcttg 



tataggggct 
aactttgcat 
ccctgctgat 
tttttttagg 
gaagtcgaaa 
gttttctgtg 
gagatttctt 
taccattatt 
gtcttttgta 
atggaatgta 
ggttagaaag 
gcttcggagg 
tgatgctcta 
acctgaaccg 
aattgagaga 
tgacgaaata 
ctgtaaagaa 
acagaaggca 
cgttgcgtta 
ggaggaaata 
atgggctagt 
ctcgatgact 
aatctgtgca 
atcaaacatg 
tgrctatact 
ttctctgggt 
cgaggagagc 
gccatcggag 
aatctttctg 
gttcgctgtc 
taaggacgcc 
tggaaaaacc 
cggaagggaa 
agctactagt 
atttgacgcc 
gttgaagatt 
tatcaataga 
tgtagagcgt 
tatttataaa 
ggtttctggg 
gtttactcag 
gcatgagatt 
aataggtctg 
ggcaatggtg 
aaaggtcgat 
ttcgctgtat 
tatgcaggag 
tgccgtcacc 
aagtaattta 
cgtattgcga 
tatgataaag 
gtctatttct 
tcatttagat 
gccaacctcg 
tacgtatatg 
cgaatatccg 
gccggttttt 
cttttatact 
ttctgattat 
ccatttctct 
gatgtggtct 



caatttaggg 
tacactcata 
gataggttcg 
ttagttaggg 
tctgagtart 
tggtttcccg 
tctggaaarg 
aatcacat ra 
gaatctatac 
ccggtcgatc 
gtgcagarrg 
ttcgctgata 
gttcaaaccg 
tttatgacct 
gagagtgtct 
aggaataatt 
ctgaatgtra 
ggggtaacag 
tccaataatt 
ctgt:;ga- ag 
atgaaatacg' 
ttacctgcca 
agaaagcccc 
aatttcttra 
gggaccarzc 
tctacgcr~g 
atgcaaacrt 
aagaaacacg 
tcgtatgacg 
tgttctgata 
acagtcga~c 
gctgaaarta 
gcggctgcTia 
gataacgtta 
gtctacgrag 
tcgggttcra 
gttatgaart 
aggtatatra 
gctgcggtrt 
gctggtartc 
tctgaraaac 
cagggggaga 
attgcccgag 
tattataccg 
cagtcgatrt 
gtccaccgca 
ttctatgaca 
atgcggtr ga 
gatccggrgc 
acggcttgcg 
aggaatatga 
atagtagara 
tgcgttagag 
gcggttggcc 
catatgatra 
gccttacaaa 
aagtatcrga 
aggaaaaagc 
gaaattctrg 
atcgagatgg 
atgggtca-a 



ttgatggtga 
gtttggagaa 
tgtatattaa 
cagacacaca 
ttgcgttgaa 
aagcgaagcg 
tgaaagtctc 
gcacgtacga 
gctctagggt 
agcttactga 
agttaatgtc 
gtctcaaatc 
gttggtttga 
tttcagatta 
ctgagctgct 
acagcggagt 
atcctatgct 
tcacgggcct 
ctgtagatac 
cagacaagaa 
gcaataataa 
cctgggcaga 
aattttcaaa 
aggtgagtga 
gagagaggca 
gtaatcttga 
ttggattgta 
cctgggctgt 
aatccggttc 
ccaaagttta 
ctggggtgta 
tagcgagggt 
tgatcaggcg 
ggacttttga 
atgaaggtct 
aaaaggcctt 
ttgattatcc 
cccataggtg 
ctaccactag 
tgaggcccga 
aatccttgat 
cctttgagga 
attcaccaca 
ttgtgttcga 
tgactatgtt 
atatcttcct 
ggtgtcttcc 
gggataatga 
cggctttgat 
aaaggccgcg 
atactcctga 
atttcttttc 
ctagttctat 
agttagctaa 
aaaggcagcc 
ctattgtata 
ctactaagtt 
cagaggatct 
agctcgatgt 
caatttggga 
agagaactat 



cgatgtgcat 
tattaagttg 
ggagtttatg 
tatgctccat 
cacccctccg 
gaaggtgttg 
taggatgctt 
taacaaggcc 
aattgtaaac 
tatctcattc 
tgataaggtt 
cgccgtagaa 
cacctctagc 
tctcgaaggg 
tgcttccgga 
tgaatttgat 
aatcggtcat 
aggcacgctc 
atgtgatgat 
tcactcttat 
cggggcctta 
aaagggtaag 
gccactcgat 
tctgaagttg 
gatgaagaat 
gagaattgtt 
cgattgcgag 
agtcctggcg 
tcctataatt 
tagtgtaatt 
tataacttta 
caattggaaa 
aagagcctgt 
ttctttcgta 
tatggtccac 
tgtcttcggt 
taaggaatta 
tcctagagat 
tccggttgta 
gttgacgaag 
caagagtggg 
gacggcggtt 
cgtgttagtg 
tgccgtaaca 
tgctactact 
ccctgttact 
agggaattct 
attcaatttg 
taagagtgag 
tattccgggt 
tttggctggg 
ttcctttgtc 
tcagagtttt 
ttttaacttc 
taagagtcgg 
tcatccgaag 
tcttagcatg 
gcaggaattt 
ttctaaatat 
aaggctggga 
actgcaagat 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3350 

3420 

3480 

3540 

3600 

3560 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 
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ttccaagctg 
ttcataggta 
tgttttaagg 
tatcctgata 
aagaagtatg 
taccctgacc 
catgtcgagg 
tatttccatt 
tttgtaataa 
atagatgtct 



gaataaagac 
atacttttat 
ctagtttttg 
ttcaggctac 
gttacttctg 
ctttgaagtt 
agtttcgtat 
tgctcgacga 
attgtttgtg 
ctaag 



gctcatttat 
tattgcagcg 
tggtgatgat 
tgccaatttg 
cgggaaatat 
aattagtaaa 
atctctcctc 
tgcaatccac 
taagtacttg 



tatcaaagga 
tgtgtagcta 
tcgttaatct 
grttggaatt 
atcattcatc 
ttaggtagta 
gatgtcgctc 
gagttgtttc 
agtgataagc 



agtctggcga 
gtatgttacc 
accttcctaa 
ttgaggcgaa 
acgccaacgg 
agagtcttgt 
acagtttgtt 
ctaacgctgg 
gccttttccg 



cgtaactact 
gttagataag 
gggtttggag 
acttttccgg 
ttgtattgtt 
agggtacgag 
taatggtgct 
gggttgcagt 
tagtctttat 



4 4 40 
4500 
4560 
4620 
4 680 
4740 
4800 
4860 
4920 
4935 



<210> 6 
<211> 1645 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: amino acid 

sequence corresponding to 18 6 kD protein of CGMMV 
isolate 4 

<400> 6 

Met Ala Asn lie Asn Glu Gin lie Asn Asn Gin Arg Asp Ala Ala Ala 
15 10 15 

Ser Gly Arg Asn Asn Leu Val Ser Gin Leu Ala Ser Lys Arg Val Tyr 
20 25 30 

Asp Glu Ala Val Arg Ser Leu Asp His Gin Asp Arg Arg Pro Lys Met 
35 40 45 

Asn Phe Ser Arg Val Val Ser Thr Glu His Thr Arg Leu Val Thr Asp 
50 55 60 

Ala Tyr Pro Glu Phe Ser He Ser Phe Thr Ala Thr Lys Asn Ser Val 
65 70 75 80 

His Ser Leu Ala Gly Gly Leu Arg Leu Leu Glu Leu Glu Tyr Met Met 
85 90 95 

Met Gin Val Pro Tyr Gly Ser Pro Cys Phe Asp He Gly Gly Asn Tyr 

100 105 110 

Thr Gin His Leu Phe Lys Gly Arg Ser Tyr Val His Cys Cys Asn Pro 
115 120 125 

Cys Leu Asp Leu Lys Asp Val Ala Arg Asn Val Met Tyr Asn Asp Met 
130 135 140 

He Thr Gin His Val Gin Arg His Lys Gly Ser Gly Gly Cys Arg Pro 
145 150 155 160 

Leu Pro Thr Phe Gin He Asp Ala Phe Arg Arg Tyr Glu Asp Ser Pro 
165 170 175 

Val Ala Val Thr Cys Pro Asp Val Phe Gin Glu Cys Ser Tyr Asp Phe 
180 185 190 



Gly Ser Gly Arg Asp Asn His Ala Val Ser Leu His Ser He Tyr Asp 
195 200 205 
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lie Pro Tyr Ser 

210 

Val Cys Tyr Ala 
225 

Pro Val Gly Asn 



Asp Asp Val His 
260 

His Ser Leu Glu 
275 

Ala Asp Asp Arg 
290 

Asp Thr Phe Phe 
305 

Lys Ser Val Gly 



Asn Thr Fro Pro 
340 

Pro Glu Ala Lys 
355 

Phe Leu Ser Gly 
370 

Phe Val His Thr 
385 

Leu Val Trp Lys 



Val He Val Asn 
420 

Asp Gin Leu Thr 
435 

Arg Lys Val Gin 
450 

Arg Gly Leu Leu 
465 

Gly Leu Gly Asp 



Asp Thr Ser Ser 
500 

Thr Phe Ser Asp 

515 



Ser He Gly Pro 

215 

Ala Phe His Phe 
230 

Leu Asn Ser He 
245 

Phe Leu Phe Ser 



Asn He Lys Leu 
280 

Phe Val Tyr He 
295 

Phe Arg Leu Val 
310 

His Tyr Ser Lys 

325 

He Phe Gin Asp 



Arg Lys Val Leu 
360 

Asn Val Lys Val 
375 

He He Asn His 
390 

Asn Val Gin Ser 
405 

Gly Val Ser Val 



Asp He Ser Phe 
440 

He Glu Leu Met 
455 

Arg Arg Phe Ala 
470 

Cys Val Tyr Asp 
485 

Asp Glu Leu Lys 



Tyr Leu Glu Gly 
520 



Ala Leu His Arg 

220 

Ser Glu Ala Leu 
235 

Gly Ala Gin Phe 
250 

Glu Glu Ser Thr 
265 

He Val Met Arg 



Lys Glu Phe Met 
300 

Arg Ala Asp Thr 
315 

Ser Lys Ser Glu 
330 

Lys Ala Thr Phe 
345 

He Pro Lys Phe 



Ser Arg Met Leu 
380 

He Ser Thr Tyr 
395 

Phe Val Glu Ser 
410 

Lys Ser Glu Trp 
425 

Ser He Phe Leu 



Ser Asp Lys Val 
460 

Asp Ser Leu Lys 
475 

Ala Leu Val Gin 
490 

Val Leu Leu Pro 
505 

Met Tyr Glu Ala 



Lys Asn Val Arg 



Leu Leu Gly Ser 
240 

Arg Val Asp Gly 
255 

Leu His Tyr Thr 
270 

Thr Tyr Phe Pro 
285 

Val Lys Arg Val 



His Met Leu His 
320 

Tyr Phe Ala Leu 

335 

Ser Val Trp Phe 
350 

Glu Leu Ser Arg 
365 

Val Asp Ala Asp 



Asp Asn Lys Ala 
400 

He Arg Ser Arg 
415 

Asn Val Pro Val 
430 

Leu Val Lys Val 
445 

Val He Glu Ala 



Ser Ala Val Glu 
480 

Thr Gly Trp Phe 
495 

Glu Pro Phe Met 
510 

Asp Ala Lys He 
525 
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Glu Arg Glu Ser 
530 

Lys Lys lie Asp 
545 

Val Glu Lys Phe 



Leu lie Gly His 

580 

Thr Val Thr Gly 
595 

Ala Leu Ser Asn 
610 

Glu Asp Met Glu 
625 

lie Ser Pro Glu 



Asn Gly Ala Leu 
660 

Ala Thr Trp Ala 
675 

Cys Val Arg Lys 
690 

Leu Arg Leu Ser 
705 

Lys Lys Thr lie 



Gin Met Lys Asn 
740 

Leu Gly Asn Leu 
755 

Glu Ser Met Gin 
770 

Leu Leu Leu Pro 

785 

Ser Asp Asp Thr 



Ser Pro lie lie 
820 

Asp Thr Lys Val 
835 

Glu Ala Thr Val 



Val Ser Glu Leu 
535 

Glu lie Arg Asn 
550 

Gin Glu Phe Cys 

565 

Val He Glu Ala 



Leu Gly Thr Leu 
600 

Asn Ser Val Asp 
615 

Glu He Val Leu 
630 

Met Ser Arg Trp 
645 

Val Glu Tyr Lys 



Glu Lys Gly Lys 
680 

Pro Gin Phe Ser 

695 

Asn Met Asn Phe 
710 

Thr Pro Val Val 
725 

Tyr He Asp Tyr 



Glu Arg He Val 
760 

Thr Phe Gly Leu 
775 

Ser Glu Lys Lys 
790 

Thr Arg He He 

805 

Asp Lys Lys Asn 



Tyr Ser Val He 
840 

Asp Pro Gly Val 



12 



Leu Ala Ser Gly 
540 

Asn Tyr Ser Gly 
555 

Lys Glu Leu Asn 
570 

He Phe Ser Gin 

585 

Ser Pro Glu Met 



Thr Cys Asp Asp 
620 

He Ala Asp Lys 
635 

Ala Ser Met Lys 
650 

Val Gly Thr Ser 
665 

Ala Val Leu Pro 



Lys Pro Leu Asp 

700 

Phe Lys Val Ser 
715 

Tyr Thr Gly Thr 
730 

Leu Ser Ala Ser 
745 

Arg Ser Asp Trp 



Tyr Asp Cys Glu 
780 

His Ala Trp Ala 
795 

Phe Leu Ser Tyr 

810 

Trp Lys Arg Phe 
825 

Arg Ser Leu Glu 



Tyr He Thr Leu 



Asp Asp Leu Phe 



Val Glu Phe Asp 
560 

Val Asn Pro Met 
575 

Lys Ala Gly Val 

590 

Gly Ala Ser Val 
605 

Met Asp Val Thr 



Asn His Ser Tyr 
640 

Tyr Gly Asn Asn 
655 

Met Thr Leu Pro 
670 

Leu Ser Gly He 
685 

Glu Glu Asp Asp 



Asp Leu Lys Leu 
720 

He Arg Glu Arg 
735 

Leu Gly Ser Thr 
750 

Asn Gly Thr Glu 
765 

Lys Cys Lys Trp 



Val Val Leu Ala 
800 

Asp Glu Ser Gly 

815 

Ala Val Cys Ser 
830 

Val Leu Asn Lys 
845 

Val Asp Gly Val 
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850 



855 



860 



Pro Gly Cys Gly Lys Thr Ala Glu lie lie Ala Arg Val Asn Trp Lys 
865 870 875 880 

Thr Asp Leu Val Leu Thr Pro Gly Arg Glu Ala Ala Ala Met lie Arg 
885 890 895 

Arg Arg Ala Cys Ala Leu His Lys Ser Pro Val Ala Thr Ser Asp Asn 
900 905 910 

Val Arg Thr Phe Asp Ser Phe Val Met Asn Lys Lys Val Phe Lys Phe 
915 920 925 

Asp Ala Val Tyr Val Asp Glu Gly Leu Met Val His Thr Gly Leu Leu 
930 935 940 

Asn Phe Ala Leu Lys lie Ser Gly Cys Lys Lys Ala Phe Val Phe Gly 
945 950 955 . 960 

Asp Ala Lys Gin lie Pro Phe lie Asn Arg Val Met Asn Phe Asp Tyr 
965 970 975 

Pro Lys Glu Leu Arg Thr Leu lie Val Asp Asn Val Glu Arg Arg Tyr 
980 985 990 

lie Thr His Arg Cys Pro Arg Asp Val Thr Ser Phe Leu Asn Thr lie 
995 1000 1005 

Tyr Lys Ala Ala Val Ser Thr Thr Ser Pro Val Val His Ser Val Lys 
1010 1015 1020 

Ala lie Lys Val Ser Gly Ala Gly lie Leu Arg Pro Glu Leu Thr Lys 
1025 1030 1035 1040 

lie Lys Gly Lys lie lie Thr Phe Thr Gin Ser Asp Lys Gin Ser Leu 
1045 1050 1055 

lie Lys Ser Gly Tyr Asn Asp Val Asn Thr Val His Glu lie Gin Gly 
1060 1065 1070 

Glu Thr Phe Glu Glu Thr Ala Val Val Arg Ala Thr Pro Thr Pro lie 
1075 1080 1085 

Gly Leu lie Ala Arg Asp Ser Pro His Val Leu Val Ala Leu Thr Arg 
1090 1095 1100 

His Thr Lys Ala Met Val Tyr Tyr Thr Val Val Phe Asp Ala Val Thr 
1105 1110 1115 1120 

Ser lie lie Ala Asp Val Glu Lys Val Asp Gin Ser lie Leu Thr Met 
1125 1130 1135 

Phe Ala Thr Thr Val Pro Thr Lys Met Glu Asn Ser Leu Tyr Val His 
1140 1145 1150 

Arg Asn lie Phe Leu Pro Val Thr Lys Thr Gly Phe Tyr Thr Asp Met 
1155 1160 1165 



Gin Glu Phe Tyr Asp Arg Cys Leu Pro Gly Asn Ser Phe Val Leu Asn 
1170 1175 " 1180 
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Asp Phe Asp Ala Val Thr Met Arg Leu Arg Asp Asn Glu Phe Asn Leu 
ll'ss 1190 1195 1200 

Gin Pro Cys Arg Leu Thr Leu Ser Asn Leu Asp Pro Val Pro Ala Leu 
1205 1210 1215 

lie Lys Ser Glu Ala Lys Asp Phe Leu Val Pro Val Leu Arg Thr Ala 

1220 1225 1230 

Cys Glu Arg Pro Arg lie Pro Gly Leu Leu Glu Asn Leu Val Ala Met 
1235 1240 1245 

lie Lys Arg Asn Met Asn Thr Pro Asp Leu Ala Gly Thr Val Asp lie 
1250 1255 1260 

Thr Asn Met Ser lie Ser lie Val Asp Asn Phe Phe Ser Ser Phe Val 
1265 1270 1275 1280 

Arg Asp Glu Val Leu Leu Asp His Leu Asp Cys Val Arg Ala Ser Ser 
1285 1290 1295 

He Gin Ser Phe Ser Asp Trp Phe Ser Cys Gin Pro Thr Ser Ala Val 
1300 1305 1310 



Gly Gin Leu Ala Asn Phe Asn Phe 

1315 1320 

Tyr Met His Met He Lys Arg Gin 

1330 1335 

He Gin Ser Glu Tyr Pro Ala Leu 

1345 1350 



lie Asp Leu Pro Ala Phe Asp Thr 
1325 

Pro Lys Ser Arg Leu Asp Thr Ser 
1340 

Gin Thr He Val Tyr His Pro Lys 
1355 1360 



Val Val Asn Ala Val Phe Gly Pro Val Phe Lys Tyr Leu Thr Thr Lys 
1365 1370 1375 

Phe Leu Ser Met Val Asp Asn Ser Lys Phe Phe Phe Tyr Thr Arg Lys 
1380 1385 1390 

Lys Pro Glu Asp Leu Gin Glu Phe Phe Ser Asp Leu Ser Ser His Ser 
1395 1400 1405 

Asp Tyr Glu He Leu Glu Leu Asp Val Ser Lys Tyr Asp Lys Ser Gin 
1410 1415 1420 

Ser Asp Phe His Phe Ser He Glu Met Ala He Trp Glu Arg Leu Gly 
1425 1430 1435 1440 

Leu Asp Asp He Leu Ala Trp Met Trp Ser Met Gly His Lys Arg Thr 
1445 1450 1455 

He Leu Gin Asp Phe Gin Ala Gly He Lys Thr Leu He Tyr Tyr Gin 
1460 1465 1470 

Arg Lys Ser Gly Asp Val Thr Thr Phe He Gly Asn Thr Phe He He 
1475 1480 1485 



Ala Ala Cys Val Ala Ser Met Leu Pro Leu Asp Lys Cys Phe Lys Ala 
1490 1495 1500 
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Ser Phe Cys Gly Asp Asp Ser Leu lie Tyr Leu Pro Lys Gly Leu Glu 
1505 ' 1510 1515 1520 

Tyr Pro Asp lie Gin Ala Thr Ala Asn Leu Val Trp Asn Phe Glu Ala 
1525 1530 1535 

Lys Leu Phe Arg Lys Lys Tyr Gly Tyr Phe Cys Gly Lys Tyr lie lie 
1540 1545 1550 

His His Ala Asn Gly Cys lie Val Tyr Pro Asp Pro Leu Lys Leu lie 

1555 1560 1565 

Ser Lys Leu Gly Ser Lys Ser Leu Val Gly Tyr Glu His Val Glu Glu 
1570 1575 1580 

Phe Arg lie Ser Leu Leu Asp Val Ala His Ser Leu Phe Asn Gly Ala 
1585 1590 1595 1600 

Tyr Phe His Leu Leu Asp Asp Ala lie His Glu Leu Phe Pro Asn Ala 
1605 1610 1615 

Gly Gly Cys Ser Phe Val lie Asn Cys Leu Cys Lys Tyr Leu Ser Asp 
1620 1625 1630 



Lys Arg Leu Phe Arg Ser Leu Tyr lie Asp Val Ser Lys 
1635 1640 1645 



<210> 7 
<211> 1139 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 1 



<400> 7 

aattcggctz 

ttgcagaagg 

ggttcatccc 

ttgtgaggct 

cctgtctagt 

agtccgttgt 

ctttcttcga 

ctaggtcgca 

cggactctgc 

acgcctagca 

tttctagtgg 

gagtctttgt 

ggtttttacg 

tctacggata 

gaatcgctta 

gataatttaa 

gccgcgtttt 

tctgacggtg 

tgctcattgg 



ctgtaggggt 
aactatatgc 
taactattct 
ctctaacgta 
tgccactact 
ctcttccgat 
taaagtccct 
gtctgtcgtt 
ttctgaagag 
aacttattgc 
cgtcgcaagg 
ctgcgttacc 
ccttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaatcaat 
cggtagtctg 
gtgcacacca 
tttgcgaaaa 



ggtgctactg 
aaattttcag 
gtcgtggttg 
ggtattaagg 
aactctatta 
cagtcgattg 
atttcagcgg 
ggtcgtggta 
tccagttctg 
gtttagtgct 
tactgctttc 
ttcatccgtt 
cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaagggg 
gtcagaggct 
tagtgcatag 
cctctcgcgt 



ttgctttggt 
ctcccgccac 
cggatgccct 
atggttttca 
ttaaaaaggg 
ttctagattc 
ctgtaatggc 
agcggcattc 
tttctttcga 
tcttatgctc 
caaacccagg 
ctagatatta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 
tttgatgttt 
accacctcca 
tgctttcccg 
gtgacgttga 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 

tccattaact 
gcttagagct 
tttatctgag 
gagagacccc 
taaacctcca 
agatggctta 
ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgtttc 
ctagcaatcc 
cagccgctag 
atgatagggc 
aggcttagcc 
ttcactttaa 
agtttctatg 



atgcattctg 
ttctctgtta 
tggtctttat 
ttagaggtcg 
tctgtagttg 
aaagttgagc 
agttataggt 
aatcggaggt 
caatccgatc 
tttacttaat 
ttccttccgt 
cccgagtgcg 
gcttcttagc 
gacgactgct 
ggctgagata 
ttcttttgaa 
ttgagggtct 
tcgaacggtt 
ggcaagccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1139 
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<210> 8 
<211> 1139 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 2 



<400> 8 

aattcggctt 

ttgcagaagg 

ggttcatccc 

ttgtgaggct 

cctgtctagt 

agtccgttgt 

ctttcttcga 

ctaggtcgca 

cggactctgc 

acgcctagca 

tttctagtgg 

gagtctttgt 

ggtttttacg 

tctacggata 

gaatcgctta 

gataatttaa 

gccgcgtttt 

tctgacggtg 

tgctcattgg 



ctgtaggggt 

aactatatgc 
taactattct 
ctctaacgta 
tgccactact 
ctcttccgat 
taaagtccct 
gtctgtcgtt 
ttctgaagag 
aacttattgc 
cgtcgcaagg 
ctgcgttacc 
ctttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaatcaat 
cggtagtctg 
gtgcacacca 
tttgcgaaaa 



ggtgctactg 

aaattttcag 
gtcgtggttg 
ggtattaagg 
aactctatta 
cagtcgattg 
atttcagcgg 
ggtcgtggta 
tccagttctg 
gtttagtgct 
tactgctttc 
ttcatccgct 
cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaagggg 
gtcagaggct 
tagtgcatag 
cctctcgcgt 



ttgctttggt 
ctcccgccac 
cggatgccct 
atggttttca 
ttaaaaaggg 
ttctagattc 
ctgtaatggc 
agcggcattc 
tttctttcga 
tcttargctc 
caaacccagg 
gtagaratta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 
tttgatgttt 
accacctcca 
tgctttcccg 
gtgacgttga 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 
tccattaact 
gcttagagct 
tttatctgag 
gagagacccc 
taaacctcca 
agatggctta 
ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgtttc 
ctagcaatcc 
cagccgctag 
atgatagggc 
aggcttagcc 
ttcactttaa 
agtttctatg 



atgcattctg 

ttctctgtta 
tggtctttat 
ttagaggtcg 
tctgtagttg 
aaagttgagc 
agttataggt 
aatc.ggaggt 
caatccgatc 
tttacttaat 
ttccttccgt 
cccgagtgcg 
gcttcttagc 
gacgactgct 
ggctgagata 
ttcttttgaa 
ttgagggtct 
tcgaacggtt 
ggcaagccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

500 

660 

720 

780 

840 

900 

960 

1020 

1080 

1139 



<210> 9 
<211> 1139 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 3 



<400> 9 

aattcggctt 

ttgcagaagg 

ggttcatccc 

ttgtgaggct 

cctgtctagt 

agtccgttgt 

ctttcttcga 

ctaggtcgca 

cgggctctgc 

acgcctagca 

tttctagtgg 

gagtctttgt 

ggtttttacg 

tctacggata 

gagtcgctta 

gataatttaa 

gccgcgtttt 

tctgacggtg 



ctgtaggggt 
aactatatgc 
taactattct 

ctctaacgta 
tgccactact 
ctcttccgat 
taaagtccct 
gtctgtcgtt 
ttctgaagag 
aacttattgc 
cgtcgcaagg 
ctgcgttacc 
ctttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaatcaat 
cggtagtctg 
gtgcacacca 



ggtgctactg 
aaattttcag 
gtcgtggttg 
ggtattaagg 
aactctatta 
cagtcgattg 
atttcagcgg 
ggtcgtggta 
tccagttctg 
gtttagtgct 
tactgctttc 
ttcatccgtc 
cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaaaggg 
gtcagaggct 
tagtgcatag 



ttgctttggt 
ctcccgccac 
cggatgccct 

atggttttca 
ttaaaaaggg 
ttctagattc 
ctgtaatggc 
agcggtattc 
tttctttcga 
tcttatgttc 
caaacccagg 
gtagatatta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 
tttgatgttt 
accacctcca 
tgctttcccg 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 

tccattaact 
gcttagagct 
tttatctgag 
gagagacccc 
taaacctcca 
agatggctta 
ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgtttc 
ctagcaatcc 
cagccgctag 
atgatagggc 
aggcttagcc 
ttcactttaa 



atgcattctg 
ttctctgtta 
tggtctttat 
ttagaggtcg 
tctgtagttg 
aaagttgagc 
agttataggt 
aatcggaggt 
caatccgatc 
tttacttaat 
ttccttccgt 
cccgagtgcg 
gtttcttagc 
gacgactgct 
ggctgagata 
ttcttttgaa 
ttgagggtct 
tcgaacggtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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tgctcattgg tttgcgaaaa cctctcgcgt gtgacgttga agtttctatg ggcaagccg 1139 



<210> 10 

<211> 1139 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 4 



<400> 10 

aattcggctt 

ttgcagaagg 

ggttcatccc 

ttgtgaggct 

cctgtttagt 

agtccgttgt 

ctttcttcga 

ctaggtcgca 

cggactctgc 

acgcctagca 

tttctggtgg 

gagtctttgt 

ggtttttacg 

tctacggata 

gagtcgctta 

gataatttaa 

gccgcgtttt 

tctgacggtg 

tgctcattgg 



ctgtaggggt 
aactatatgc 
taactattct 
ctctaacgta 
tgccactact 
ctcttccgat 
taaagtccct 
gtctgtcgtt 
ttctgaagag 
aacttattgc 
cgtcgcaagg 
ctgcgttacc 
ctttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaatcaat 
cggtagtctg 
gtgcacacca 
tttgcgaaaa 



ggtgctactg 
aaattttcag 
gtcgtggttg 
ggtattaagg 
aactctatta 
cagtcgattg 
atttcagcag 
ggtcgtggta 
tccggttctg 
gtttagtgct 
tactgctttc 
ttcatccgtc 
cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaagggg 
gtcagaggct 
tagtgcatag 
cctctcgcgt 



ttgctttggt 
ctcccgccac 
cggatgccct 
atggttttca 
ttaaaagggg 
ttctagattc 
ctgtaatggc 
agcggcattc 
tttctttcga 
tcttatgttc 
caaacccagg 
gtagatatta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 
tttgatgttt 
accacctcca 
tgctttcccg 
gtgacgttga 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 
tccattaact 
gcttagagct 
tttatctgag 
gagagacccc 
taaacctcca 
agatggctta 
ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgcttc 
ctagcaatcc 
cagccgctag 
atgatagggc 
aggcttagcc 
ttcactttaa 
agtttctatg 



atgcattctg 
ttctccgtta 
tggtctttat 
ttagaggtcg 
tctgtagttg 
aaagttgagc 
agttataggt 
aatcggaggt 
caatccgatc 
tctacttaat 
ttccttccgt 
cccgagtgcg 
gcttcttagc 
gacgactgct 
ggctgagata 
ttcttttgaa 
ttgagggtct 
tcgaacggtt 
ggcaagccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1139 



<210> 11 
<211> 1139 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of .Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 5 



<400> 11 

aattcggctt 
ttgcagaagg 
ggttcatccc 
ttgtgaggct 
cctgtttagt 
agtccgttgt 
ctttcttcga 
ctaggtcgca 
cggactctgc 
acgcctagca 
tttctggtgg 
gagtctttgt 
ggtttttacg 
tctacggata 
gagtcgctta 



ctgtaggggt 
aactatatgc 

taactattct 
ctctaacgta 
tgccactact 
ctcttccgat 
taaagtccct 
gtctgtcgtt 
ttctgaagag 
aacttattgc 
cgtcgcaagg 
ctgcgttacc 
ctttcctcaa 
cgcgtaatag 
acgcagttaa 



ggtgctactg 
aaattttcag 

gtcgtggttg 
ggtattaagg 
aactctatta 
cagtcgattg 
atttcagcag 
ggrcgtggta 
tccggttctg 
gtttagtgct 
tactgctttc 
ttcatccgtc 
cggtcctgtg 
ggtcattgag 
gcgtactgac 



ttgctttggt 
ctcccgccac 
cggatgccct 
atggttttca 
ttaaaaaggg 
ttctagattc 
ctgtaatggc 
agcggcattc 
tttctttcga 
tcttatgttc 
caaacccagg 
gtagatatta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 
tccattaact 
gcttagagct 
tttatctgag 
gagagacccc 
taaacctcca 
agatggctta 
ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgtttc 
ctagcaatcc 
cagccgctag 



atgcattctg 60 
ttctccgtta 120 
tggtctttat 180 
ttagaggtcg 240 
tctgtagttg 300 
aaagttgagc 360 
agttataggt 420 
aatcggaggt 4 80 
caatccgatc 540 
tctacttaat 600 
ttccttccgt 660 
cccgagtgcg 720 
gcttcttagc 780 
gacgactgct 840 
ggctgagata 900 
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gataatttaa tagaatcaat ctctaagggg tttgatgttt atgatagggc ttcttttgaa 960 

gccgcgtttt cggtagtctg gtcagaggct accacctcca aggcttagcc ttgagggtct 1020 

tctgacggtg gtgcacacca tagtgcatag tgctttcccg ttcactttaa tcgaacggtt 1080 

tgctcattgg tttgcgaaaa cctctcgcgt gtgacgttga agtttctatg ggcaagccg 1139 



<210> 12 
<211> 1139 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 6 



<400> 12 

aattcggctt 

ttgcagaagg 

ggttcatccc 

ttgtgaggct 

cctgtctagt 

agtccgttgt 

ctttcttcga 

ctaggtcgca 

cggactctgc 

acgcctagca 

tttctagtgg 

gagtctttgt 

ggtttttacg 

tctacggata 

gaatcgctta 

gataatttaa 

gccgcgtttt 

tctgacggtg 

tgctcattgg 



ctgtaggggt 
aactatatgc 

taactattct 
ctctaacgta 
tgccactact 
ctcttccgat 
taaagtccct 
gtctgtcgtt 
ttctgaagag 
aacttattgc 
cgtcgcaagg 
ctgcgttacc 
ctttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaatcaat 
cggtagtctg 
gtgcacacca 
tttgcgaaaa 



ggtgctactg 
aaattttcag 

gtcgtggttg 
ggtattaagg 
aactctatta 
cagtcgattg 
atttcagcgg 
ggtcgtggta 
tccagttctg 
gtttagtgct 
tactgctttc 
ttcatccgtt 
cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaagggg 
gtcagaggct 
tagtgcatag 
cctctcgcgt 



ttgctttggt 
ctcccgccac 
cggatgccct 
atggttttca 
ttaaaaagga 
ttctagattc 
ctgraatggc 
agcggcattc 
tttctttcga 
tcttatgttc 
caaacccagg 
gtagatatta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 
tttgatgttt 
accacctcca 
tgctttcccg 
gtgacgttga 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 
tccattaact 
gcttagagct 
tttatctgag 
gagagacccc 
taaacctcca 
agatggctta 
ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgtttc 
ctagcaatcc 
cagccgctag 
atgacagggc 
aggcttagcc 
ttcactttaa 
agtttctatg 



atgcattctg 
ttctctgtta 
tggtcrttat 
ttagaggtcg 
tctgtagttg 
aaagttgagc 
agttataggt 
aatcggaggt 
caatccgatc 
tttacttaat 
ttccttccgt 
cccgagtgcg 
gcttcttagc 
gacgactgct 
ggctgagata 
ttcttttgaa 
ttgagggtct 
tcgaacggtt 
ggcaagccg 



60 
120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1139 



<210> 13 
<211> 1138 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 7 

<400> 13 

aattcggctt ctgtaggggt ggtgctactg ttgctttggt tgacacaagg atgcattctg 60 
ttgcagaagg aactatatgc aaattttcag ctcccgccac cgtccgcgag ttctctgtta 120 
ggttcatccc taactattct gtcgtggttg cggatgccct tcgcgatcct tggtctttat 180 
ttgtgaggct ctctaacgta ggtattaagg atggttttca tccattaact ttagaggtcg 240 
cctgtctagt tgccactact aactctatta traaaaaggg gcttagagct tctgtagttg 300 
agtccgttgt ctcttccgat cagtcgattg ttctagattc tttatctgag aaagttgagc 360 
ctttcttcga taaagtccct atttcagcgg ctgtgatggc gagggacccc agttataggt 420 
ctaggtcgca gtctgtcgtt ggtcgtggta agcggcattc taaacctcca aatcggaggt 480 
cggactctgc ttctgaagag tccagttctg tttctttcga agatggctta caatccgatc 540 
acgcctagca aacttattgc gtttagtgct tcttatgttc ccgttagaac tttacttaat 600 
tttctagtgg cgtcgcaagg tactgctttc caa=cccagg caggaagaga ttccttccgt 660 
gagtctttgt ctgcgttacc ttcatccgtt gtagaratta attctaggtt cccgaatgcg 720 
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ggtttttacg 
tctacggata 
gagtcgctta 
gataatttaa 
gccgcgtttt 
tctgacggtg 
tgctcattgg 



ctttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaatcaat 
cggtagtctg 
gtgcacacca 
tttgcgaaaa 



cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaagggg 
gtcagaggct 
tagtgcatag 
ctctcgcgtg 











(T"l~ i~ n'\~ "f" (~T 3 "t" r"" 
y LLyuLydL-L^ 




ydL-yaL-uyi—L 


8 4 0 


gatgcgtcta 


cagccgctag 


ggctgagata 


900 


tttgatgttt 


atgatagggc 


ttcttttgaa 


960 


accacctcca 


aggcttagcc 


ttgagggtct 


1020 


tgctttcccg 


ttcactttaa 


tcgaacggtt 


1080 


tgacgttgaa 


gtttctatgg 


gcaagccg 


1138 



<210> 14 
<211> 1139 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 8 



<400> 14 

aattcggctt 

ttgcagaagg 

ggttcatccc 

ttgtgaggct 

cctgtctagt 

agtccgttgt 

ctttcttcga 

ctaggtcgca 

cggactctgc 

acgcctagca 

tttctagtgg 

gagtctttgt 

ggtttttacg 

tctacggata 

gaatcgctta 

gataatttaa 

gccgcgtttt 

tctgacggtg 

tgctcattgg 



ctgtaggggt 
aactaratgc 
taactattct 
ctctaacgta 
tgccactact 
ctcttccgat 
taaagrccct 
gtctgccgtt 
ttctgaagag 
aacttattgc 
cgtcgcaagg 
ctgcgtcacc 
ctttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaar caat 
cggtagtctg 
gtgcacacca 
tttgcgaaaa 



ggtgctactg 
aaattttcag 
gtcgtggttg 
ggtattaagg 
aactctatta 
cagtcgattg 
atttcagcgg 
ggtcgtggta 
tccagttctg 
gtttagtgct 
tactgctttc 
ttcatccgtt 
cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaagggg 
gtcagaggct 
tagtgcatag 
cctctcgcgt 



ttgctttggt 
ctcccgccac 
cggatgccct 
atggttttca 
ttaaaaaggg 
ttctagattc 
ctgtaatggc 
agcggcattc 
tttctttcga 
tcttatgctc 
caaatccagg 
gtagatatta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 
tttgatgttt 
accacctcca 
tgctttcccg 
gtgacgttga 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 
tccattaact 
gcttagagct 
tttatctgag 
gagagacccc 
taaacctcca 
agatggctta 
ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgtttc 
ctagcaatcc 
cagccgctag 
atgatagggc 
aggcttagcc 
ttcactttaa 
agtttctatg 



atgcactctg 
ttctctgtta 
tggtctttat 
ttagaggtcg 
tctgtagttg 
aaagttgagc 
agttataggt 
aatcggaggt 
caatccgatc 
tttacttaat 
ttccttccgt 
cccgagtgcg 
gcttcttagc 
gacgactgct 
ggctgagata 
ttcttttgaa 
ttgagggtct 
tcgaacggtt 
ggcaagccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

650 

720 

780 

840 

900 

960 

1020 

1080 

1139 



<210> 15 
<211> 1139 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 9 



<400> 15 

aattcggctt 

ttgcagaagg 

ggttcatccc 

ttgtgaggct 

cctgtctagt 

agtccgttgt 

ctttcttcga 

ctaggtcaca 

cggactctgc 



ctgtaggggt 
aactatatgc 
taactattct 
ctctaacgta 
tgccactact 
ctcttccgat 
taaagtccct 
gtctgtcgtt 
ttctgaagag 



ggtgctactg 
aaattttcag 
gtcgtggctg 
ggcattaagg 
aactctatta 
cagtcgattg 
atttcagcgg 
ggtcgtggta 
tccagttctg 



ttgctttggt 
ctcccgccac 
cggatgccct 
atggttttca 
ttaaaaaggg 
ttctagattc 
ctgtaatggc 
agcggcattc 
tttcttttga 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 
tccattaact 
gcttagagct 
tttgtctgag 
tagagacccc 
taaacctcca 
agatggctta 



atgcattctg 60 
ttctctgtta 120 
tggtctttat 180 
ttagaggtcg 240 
tctgtagttg 300 
aaagttgagc 360 
agttataggt 420 
aatcggaggt 4 80 
caatccgatc 540 
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acgcctagca 
tttctagtgg 
gagtctttgt 
ggtttttacg 
tctacggata 
gagtcgctta 
gataatttaa 
gccgcgtttt 
tctgacggtg 
tgctcattgg 



aacttattgc 
cgtcgcaagg 
ctgcgttacc 
ctttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaatcaat 
cggtagtctg 
gtgcacacca 
tttgcgaaaa 



gtttagtgct 
tactgctttt 
ttcatccgtt 
cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaagggg 
gtcagaggct 
tagtgcatag 
cctctcgcgt 



tcatatgttc 
caaacccagg 
gtagatatta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 
tttgatgttt 
accacctcca 
tgttttcccg 
gtgacgttga 



ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgtttc 
ctagcaatcc 
cagccgctag 
atgatagggc 
aggcttagcc 
ttcactttaa 
agtttctatg 



tttacttaat 600 
ttccttccgt 660 
cccgagtgcg 720 
gcttcttagc 780 
gacgactgct 840 
ggctgagata 900 
ttcctttgaa 960 
ttgagggtct 1020 
tcgaacggtt 1080 
ggcaagccg 1139 



<210> 16 
<211> 1139 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV isolate 10. 



<400> 16 

aattcggctt 

ttgcagaagg 

ggttcatccc 

ttgtgaggct 

cctgtctagt 

agtccgttgt 

ctttcttcga 

ctaggtcaca 

cggactctgc 

acgcctagca 

tttctagtgg 

gagtctttgt 

ggtttttacg 

tctacggata 

gagtcgctta 

gataatttaa 

gccgcgtttt 

tctgacggtg 

tgctcattgg 



ctgtaggggt 
aactatatgc 
taactattct 
ctctaacgta 
tgccactact 
ctcttccgat 
taaagtccct 
gtctgtcgtt 
ttctgaagag 
aacttattgc 
cgtcgcaagg 
ctgcgttacc 
ctttcctcaa 
cgcgtaatag 
acgcagttaa 
tagaatcaat 
cggtagtctg 
gtgcacacca 
tttgcgaaaa 



ggtgctactg 
aaattttcag 
gtcgtggctg 
ggcattaagg 
aactctatta 
cagtcgattg 
atttcagcgg 
ggtcgtggta 
tccagttctg 
gtttagtgct 
tactgctttt 
ttcatccgtt 
cggtcctgtg 
ggtcattgag 
gcgtactgac 
ctctaagggg 
gtcagaggct 
tagtgcatag 
cctctcgcgt 



ttgctttggt 
ctcccgccac 
cggatgccct 
atggttttca 
ttaaaaaggg 
ttctagattc 
ctgtaatggc 
agcggcattc 
tttcttttga 
tcatatgttc 
caaacccagg 
gtagatatta 
ttgaggccta 
gttgtagatc 
gatgcgtcta 
tttgatgttt 
accacctcca 
tgctttcccg 
gtgacgttga 



tgacacaagg 
cgtccgcgag 
tcgcgatcct 
tccattaact 
gcttagagct 
tttgtctgag 
tagagacccc 
taaacctcca 
agatggctta 
ccgttagaac 
caggaagaga 
attctaggtt 
tcttcgtttc 
ctagcaatcc 
cagccgctag 
atgatagggc 
aggcttagcc 
ttcactttaa 
agtttctatg 



atgcattctg 
ttctctgtta 

tggtctttat 
ttagaggtcg 
tctgtagttg 
aaagttgagc 
agttataggt 
aatcggaggt 
caatccgatc 
tttacttaat 
ttccttccgt 
cccgagtgcg 
gcttcttagc 
gacgactgct 
ggctgagata 
ttcctttgaa 
ttgagggtct 
tcgaacggtt 
ggcaagccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1139 



<210> 17 
<211> 3429 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: nucleotide 
sequence corresponding to 129 kD replicase of 

CGMMV strain SH 



<400> 17 

atggcaaaca 

aatctcgtta 

catcaagaca 

cttgtaactg 

cactcccttg 

tacggctcac 



ttaatgaaca 
gccaattggc 
gacgcccgaa 
acgcgtatcc 
cgggtggtct 
cttgttatga 



aatcaacaac 
gtcaaaaagg 
aatgaatttt 
ggagttttcg 
gaggcttctt 
catcggcggt 



caacgtgacg 
gtgtatgacg 
tctcgtgtgg 
attagcttta 
gaattggaat 
aactatacgc 



ccgcggctag 
aggctgttcg 
tcagcacaga 
ccgccaccaa 
atatgatgat 
agcacttgtt 



cgggagaaac 60 
ctcgttggat 120 
gcacaccagg 180 
gaactctgta 240 
gcaggtgccc 300 
caaaggtaga 360 
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tcatatgtgc attgctgcaa tccgtgccra gatcttaagg atgttgcgag gaatgtgatg 420 

tacaacgata tgattacgca acatgtacag aggcacaagg gatcttgcgg gtgcagacct 480 

cttccaactt tccagataga tgcattcagg aggtacgata gttctccctg tgcggccacc 540 

tgttcagacg ttttccaaga gtgttcctat gattttggga gtggtaggga taatcatgca 600 

gtctcgttgc attcaatcta cgatatccct tattcttcga tcggacctgc tcttcatagg 660 

aagaatgtgc gagtttgtta tgcagccttt catttctcgg aggcattgct tttaggttcg 720 

cctgtaggta atttaaatag tattggggct cagtttaggg tcgatggtga tgatgtgcat 780 

tttcttttta gtgaagagtc tactttgcat tatactcata gtttagaaaa tatcaagtca 840 

atcgtgatgc gtacttactt tcctgctgat gataggtttg tatatattaa ggagttcatg 900 

gttaagcgtg tggatacttt tttctttagg ttggtcagag cagatacaca catgcttcat 960 

aaatctgtgg ggcactattc gaaatcgaag tctgagtact tcgcgctgaa tacccctccg 1020 

atcttccaag ataaagccac gttttctgtg tggtttcctg aagcgaagaa ggtgttgata 1080 

cccaagtttg aactttcgag attcctttct gggaatgtga aaatctctag gatgcrtgtc 1140 

gatgctgatt tcgtccatac cattattaat cacattagca cgtatgataa caaggcctta 1200 

gtgtggaaga atgttcagtc ctttgtggaa tccatacgtt caagagtaat tgtaaacgga 1260 

gtttccgtga aatctgagtg gaacgtaccg gttgatcagc tcactgatat ctcgttctcg 1320 

atattccctc tcgtgaaggt taggaaggta cagatcgagt taatgtctga taaagttgta 1380 

atcgaggcga ggggtttgct tcggaggttc gcagacagtc ttaaatctgc cgtagaagga 1440 

ctaggtgatt gcgtctatga tgctctagtt caaaccggct ggtttgacac ctctagcgac 1500 

gaactgaaag tattgctacc tgaaccgttt atgacctttt cggattatct tgaagggatg 1560 

tacgaggcag atgcaaagat cgagagagag agtgtctctg agttgctcgc ttccggtgat 1620 

gatttgttca agaaaatcga tgagataaga aacaattaca gtggagtcga atttgargta 1680 

gagaaattcc aagaattttg caaggaacrg aatgttaatc ctatgctaat tggccatgtc 1740 

atcgaagcta ttttttcgca gaaggctggg gtaacagtaa cgggtctggg cacgctctct 1800 

cctgagatgg gcgcttctgt tgcgttatcc agtacctcrg tagatacatg tgaagatatg 1860 

gatgtaactg aagatatgga ggatatagtg ttgatggcgg acaagagtca ttcttacatg 1920 

tcccctgaaa tggcgagatg ggctgatgci: aaatatggca acaataaagg ggctctagtc 1980 

gagtacaaag tcggaacctc gatgacttra cctgccacct gggcagagaa agttaaggct 2040 

gtcttaccgt tgtcggggat ctgtgtgagg aaaccccaat tttcgaagcc gcttgatgag 2100 

gaagatgact tgaggttatc aaacatgaat ttctttaagg tgagcgatct aaagttgaag 2160 

aagactatca ctccagtcgt ttacactggg accattcgag agaggcaaat gaagaattat 2220 

attgattact tatcggcctc tcttggttcc acgctgggta atctggagag aatcgtgcgg 2280 

agtgattgga acggtactga ggagagtatg caaacgttcg ggttgtatga ctgcgaaaag 2340 

tgcaagtggt tattgttgcc agccgagaag aagcacgcat gggccgtggt tctggcaagt 2400 

gacgatacca ctcgcataat cttcctttca tatgacgaat ctggttctcc tataattgat 2460 

aagaaaaact ggaagcgatt tgctgcctgr tccgagacca aagtctatag tgtaartcgt 2520 

agcttagagg ttctaaataa ggaagcaata gtcgaccccg gggttcacat aacattagtt 2580 

gacggagtgc cgggttgtgg aaagaccgcc gagattatag cgagggtcaa ttggaaaact 2640 

gatctagtat tgactcccgg aagggaggca gctgctatga ttaggcggag agcctgcgcc 2700 

ctgcacaagt cacctgtggc aaccaatgac aacgtcagaa ctttcgattc ttttgrgatg 2760 

aataggaaaa tcttcaagtt tgacgctgtc tatgttgacg agggtctgat ggtccacacg 2820 

ggattactta attttgcgtt aaagatctca ggttgtaaaa aagccttcgt ctttggtgat 2880 

gctaagcaaa tcccgtttat aaacagagtc atgaatttcg attatcctaa ggagttaaga 2940 

actttaatag tcgataatgt agagcgtagg tatgtcaccc ataggtgtcc tagagatgtc 3000 

actagttttc ttaatactat ctataaagcc gctgtcgcta ctactagtcc ggttgtacat 3060 

tctgtgaagg caattaaagt gtcaggggcc ggtattctga ggcctgagtt gacaaagatc 3120 

aaaggaaaga taataacgtt tactcaatct gataagcagt ccttgatcaa gagtgggtac 3180 

aatgatgtga atactgtgca tgaaattcag ggagaaacct ttgaggagac ggcagttgtg 3240 

cgtgccaccc cgactccaat aggtttgatt gcccgtgatt caccacatgt actagtggcc 3300 

ttaactaggc acactaaggc aatggtgtat tatactgttg tattcgatgc agttacaagt 3360 

ataatagcgg atgtggaaaa ggtcgatcag tcgatcttga ccatgtttgc taccactgtg 3420 
cctaccaaa 3429 



<210> 18 

<211> 1143 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amino acid 



wo 01/09300 



22 



PCT/NLOO/00534 



sequence corresponding to 129 kD replicase of 
CGMMV strain SH 

<400> 18 

Met Ala Asn lie Asn Glu Gin lie Asn Asn Gin Arg Asp Ala Ala Ala 
15 10 15 

Ser Gly Arg Asn Asn Leu Val Ser Gin Leu Ala Ser Lys Arg Val Tyr 
20 25 30 

Asp Glu Ala Val Arg Ser Leu Asp His Gin Asp Arg Arg Pro Lys Met 
35 40 45 

Asn Phe Ser Arg Val Val Ser Thr Glu His Thr Arg Leu Val Thr Asp 
50 55 60 

Ala Tyr Pro Glu Phe Ser lie Ser Phe Thr Ala Thr Lys Asn Ser Val 
65 70 75 80 

His Ser Leu Ala Gly Gly Leu Arg Leu Leu Glu Leu Glu Tyr Met Met 
85 90 95 

Met Gin Val Pro Tyr Gly Ser Pro Cys Tyr Asp lie Gly Gly Asn Tyr 
100 105 ' 110 

Thr Gin His Leu Phe Lys Gly Arg Ser Tyr Val His Cys Cys Asn Pro 
115 120 125 

Cys Leu Asp Leu Lys Asp Val Ala Arg Asn Val Met Tyr Asn Asp Met 
130 " 135 140 

lie Thr Gin His Val Gin Arg His Lys Gly Ser Cys Gly Cys Arg Pro 
145 150 155 160 

Leu Pro Thr Phe Gin lie Asp Ala Phe Arg Arg Tyr Asp Ser Ser Pro 
165 170 175 

Cys Ala Val Thr Cys Ser Asp Val Phe Gin Glu Cys Ser Tyr Asp Phe 
180 185 190 

Gly Ser Gly Arg Asp Asn His Ala Val Ser Leu His Ser lie Tyr Asp 
195 200 205 

lie Pro Tyr Ser Ser lie Gly Pro Ala Leu His Arg Lys Asn Val Arg 

210 215 220 

Val Cys Tyr Ala Ala Phe His Phe Ser Glu Ala Leu Leu Leu Gly Ser 
225 ' 230 235 240 

Pro Val Gly Asn Leu Asn Ser lie Gly Ala Gin Phe Arq Val Asp Gly 
245 250 ' 255 

Asp Asp Val His Phe Leu Phe Ser Glu Glu Ser Thr Leu His Tyr Thr 
260 265 270 

His Ser Leu Glu Asn lie Lys Leu lie Val Met Arg Thr Tyr Phe Pro 
275 280 285 

Ala Asp Asp Arg Phe Val Tyr lie Lys Glu Phe Met Val Lys Arg Val 
290 295 300 
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Asp Thr Phe Phe 
305 

Lys Ser Val Gly 



Asn Thr Pro Pro 

340 

Pro Glu Ala Lys 

355 

Leu Ser Gly Asn 
370 

Val His Thr He 
385 

Val Trp Lys Asn 



He Val Asn Gly 
420 

Gin Leu Thr Asp 
435 

Lys Val Gin He 
450 

Gly Leu Leu Arg 
465 

Leu Gly Asp Cys 



Thr Ser Ser Asp 
500 

Phe Ser Asp Tyr 
515 

Arg Glu Ser Val 
530 

Lys He Asp Glu 
545 

Glu Lys Phe Gin 



He Gly His Val 

580 

Val Thr Gly Leu 
595 

Leu Ser Ser Thr 
610 



Phe Arg Leu Val 
310 

His Tyr Ser Lys 
325 

He Phe Gin Asp 



Lys Val Leu He 

360 

Val Lys He Ser 
375 

He Asn His He 
390 

Val Gin Ser Phe 
405 

Val Ser Val Lys 



He Ser Phe Ser 
440 

Glu Leu Met Ser 
455 

Arg Phe Ala Asp 
470 

Val Tyr Asp Ala 
485 

Glu Leu Lys Val 



Leu Glu Gly Met 
520 

Ser Glu Leu Leu 
535 

He Arg Asn Asn 
550 

Glu Phe Cys Lys 
565 

He Glu Ala He 



Gly Thr Leu Ser 
600 

Ser Val Asp Thr 
615 



Arg Ala Asp Thr 
315 

Ser Lys Ser Glu 
330 

Lys Ala Thr Phe 
345 

Pro Lys Phe Glu 



Arg Met Leu Val 
380 

Ser Thr Tyr Asp 
395 

Val Glu Ser He 
410 

Ser Glu Trp Asn 
425 

He Phe Pro Leu 



Asp Lys Val Val 
460 

Ser Leu Lys Ser 
475 

Leu Val Gin Thr 
490 

Leu Leu Pro Glu 
505 

Tyr Glu Ala Asp 



Ala Ser Gly Asp 
540 

Tyr Ser Gly Val 
555 

Glu Leu Asn Val 
570 

Phe Ser Gin Lys 

585 

Pro Glu Met Gly 



Cys Glu Asp Met 
620 



His Met Leu His 
320 

Tyr Phe Ala Leu 
335 

Ser Val Trp Phe 
350 

Leu Ser Arg Phe 

365 

Asp Ala Asp Phe 



Asn Lys Ala Leu 
400 

Arg Ser Arg Val 
415 

Val Pro Val Asp 
430 

Val Lys Val Arg 
445 

He Glu Ala Arg 



Ala Val Glu Gly 

480 

Gly Trp Phe Asp 
495 

Pro Phe Met Thr 
510 

Ala Lys He Glu 
525 

Asp Leu Phe Lys 



Glu Phe Asp Val 
560 

Asn Pro Met Leu 
575 

Ala Gly Val Thr 
590 

Ala Ser Val Ala 
605 

Asp Val Thr Glu 



Asp Met Glu Asp He Val Leu Met Ala Asp Lys Ser His Ser Tyr Met 
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625 



630 



635 



640 



Ser Pro Glu Met Ala Arg Trp Ala Asp Val Lys Tyr Gly Asn Asn Lys 
645 650 655 

Gly Ala Leu Val Glu Tyr Lys Val Gly Thr Ser Met Thr Leu Pro Ala 
660 665 670 

Thr Trp Ala Glu Lys Val Lys Ala Val Leu Pro Leu Ser Gly He Cys 
675 680 685 

Val Arg Lys Pro Gin Phe Ser Lys Pro Leu Asp Glu Glu Asp Asp Leu 



Arg Leu Ser Asn Met Asn Phe Phe Lys Val Ser Asp Leu Lys Leu Lys 

705 710 715 720 

Lys Thr He Thr Pro Val Val Tyr Thr Gly Thr He Arg Glu Arg Gin 
725 730 735 

Met Lys Asn Tyr He Asp Tyr Leu Ser Ala Ser Leu Gly Ser Thr Leu 
740 745 750 

Gly Asn Leu Glu Arg He Val Arg Ser Asp Trp Asn Gly Thr Glu Glu 
755 760 765 

Ser Met Gin Thr Phe Gly Leu Tyr Asp Cys Glu Lys Cys Lys Trp Leu 
770 775 780 

Leu Leu Pro Ala Glu Lys Lys His Ala Trp Ala Val Val Leu Ala Ser 
785 790 795 800 

Asp Asp Thr Thr Arg He He Phe Leu Ser Tyr Asp Glu Ser Gly Ser 
805 810 815 

Pro He He Asp Lys Lys Asn Trp Lys Arg Phe Ala Val Cys Ser Glu 

820 825 830 

Thr Lys Val Tyr Ser Val He Arg Ser Leu Glu Val Leu Asn Lys Glu 
835 840 845 

Ala He Val Asp Pro Gly Val His He Thr Leu Val Asp Gly Val Pro 
850 855 860 

Gly Cys Gly Lys Thr Ala Glu He He Ala Arg Val Asn Trp Lys Thr 
865 870 875 880 

Asp Leu Val Leu Thr Pro Gly Arg Glu Ala Ala Ala Met He Arg Arg 
885 890 895 

Arg Ala Cys Ala Leu His Lys Ser Pro Val Ala Thr Asn Asp Asn Val 
900 905 910 

Arg Thr Phe Asp Ser Phe Val Met Asn Arg Lys He Phe Lys Phe Asp 
915 920 925 

Ala Val Tyr Val Asp Glu Gly Leu Met Val His Thr Gly Leu Leu Asn 

930 935 940 



690 



695 



700 



Phe Ala Leu Lys He Ser Gly Cys Lys Lys Ala Phe Val Phe Gly Asp 
945 950 955 960 
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Ala Lys Gin lie Pro Phe lie Asn Arg Val Met Asn Phe Asp Tyr Pro 
965 970 975 

Lys Glu Leu Arg Thr Leu lie Val Asp Asn Val Glu Arg Arg Tyr Val 
980 985 990 

Thr His Arg Cys Pro Arg Asp Val Thr Ser Phe Leu Asn Thr lie Tyr 
995 1000 1005 

Lys Ala Ala Val Ala Thr Thr Ser Pro Val Val His Ser Val Lys Ala 
1010 1015 1020 

lie Lys Val Ser Gly Ala Gly lie Leu Arg Pro Glu Leu Thr Lys lie 
1025 1030 1035 1040 

Lys Gly Lys lie lie Thr Phe Thr Gin Ser Asp Lys Gin Ser Leu lie 
1045 1050 1055 

Lys Ser Gly Tyr Asn Asp Val Asn Thr Val His Glu He Gin Gly Glu 
1060 1065 1070 

Thr Phe Glu Glu Thr Ala Val Val Arg Ala Thr Pro Thr Pro He Gly 
1075 1080 1085 

Leu He Ala Arg Asp Ser Pro His Val Leu Val Ala Leu Thr Arg His 
1090 1095 1100 

Thr Lys Ala Met Val Tyr Tyr Thr Val Val Phe Asp Ala Val Thr Ser 
1105 1110 1115 1120 

He He Ala Asp Val Glu Lys Val Asp Gin Ser He Leu Thr Met Phe 
1125 ' 1130 1135 



Ala Thr Thr Val Pro Thr Lys 
1140 



<210> 19 
<211> 1503 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to 57 kD protein of CGMMV 
strain SH 



<400> 19 

atgcagaatt 

tatacagaca 

gatttcgatg 

ctaaccttga 

ctgatccccg 

cttgtagcta 

actaacatgt 

ttgcttgatc 

tcgtgtcaac 

gcctttgata 

attcagtctg 



cgctgtatgt 

tgcaggagtt 
ccgtaaccat 
gtaatttaga 
ttttgcgtac 
tgataaagag 
cgatttctat 
acttagattg 
caacctcagc 
cttatatgca 
aatatccggc 



ccatcgtaat 

ctacgataga 
gcggttgagg 
tccggtaccc 
ggcctgtgaa 
gaatatgaat 
agtagataac 
tgttagggct 
ggttggccag 
tatgattaag 
cttgcaaact 



attttcctcc 
tgccttcctg 
gacaacgaat 
gctttgatta 
aggccgcgca 
actcctgatt 
ttcttttctt 
agttccattc 
ttagctaatt 
aggcaaccca 
attgtttatc 



ctgttagtaa 

ggaattcctt 
ttaacttaca 
agaatgaagc 
ttccgggtct 
tagctgggac 
cttttgttag 
aaagtttttc 
tcaatttcat 
agagtcggtt 
accctaaagt 



aacggggttt 

cgtactaaat 
accttgtagg 
gcagaatttt 
tcttgagaat 
cgtagatata 
ggacgaggtt 
tgattggttt 
agatttgcct 
agatacttcg 
ggtaaatgca 



60 

120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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grttttggtc 

aagtttttct 
tcttcccatr 
tccgatttcc 
trggcttgga 
ataaagacgc 
acctttatta 
agtttttgtg 
caggctactg 
tacttctgcg 
rtaaaattaa 
tttcgtatat 
ctcgacgatg 
tgtttgtgta 
aag 



cggttttcaa 
tttacactag 
ctgattatga 
acttctctat 
tgtggtctat 
tcatttacta 
tcgcagcgtg 
gtgatgattc 
ccaaccttgt 
ggaagtatat 
ttagtaaatt 
ctctcctcga 
caatccacga 
agtatttgag 



gtatttaacc 
gaaaaaacca 
gattcttgag 
tgagatggca 
gggtcacaaa 
tcaacggaag 
tgtggctagt 
gctgatctac 
ttggaatttt 
aattcaccat 
aggtaataag 
cgttgctcat 
attatttcct 
tgataagcgc 



accaagtttc 
gaagatctgc 
cttgatgttt 
atttgggaaa 
agaactatac 
tctggtgatg 
atgttgccgt 
cttcctaagg 
gaggcgaaac 
gccaacggct 
agtcttgtag 
agtttgttta 
aatgctgggg 
cttttccgta 



ttagtatggt 
aggaattttt 
ctaaatatga 
aattagggct 
tgcaagattt 
taactacttt 
tagataagtg 
gtttggagta 
ttttccgaaa 
gtattgttta 
ggtatgagca 
atggtgctta 
gttgcagttt 
gtctttacat 



agatagttct 
ctcagatctc 
caagtcgcaa 
tgacgatatt 
ccaagccggg 
tataggtaat 
ttttaaagct 
tcctgatata 
gaagtatggt 
ccctgaccct 
tgttgaggag 
tttccattta 
tgtaattaat 
agatgtctct 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1503 



<210> 20 
<211> 501 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: amino acid 

sequence corresponding to 57 kD protein of CGMMV 
strain SH 

<400> 20 

Met Gin Asn Ser Leu Tyr Val His Arg Asn lie Phe Leu Pro Val Ser 
15 10 15 

Lys Thr Gly Phe Tyr Thr Asp Met Gin Glu Phe Tyr Asp Arg Cys Leu 
20 25 30 

Pro Gly Asn Ser Phe Val Leu Asn Asp Phe Asp Ala Val Thr Met Arg 
35 40 45 

Leu Arg Asp Asn Glu Phe Asn Leu Gin Pro Cys Arg Leu Thr Leu Ser 
50 55 60 

Asn Leu Asp Pro Val Pro Ala Leu lie Lys Asn Glu Ala Gin Asn Phe 
65 .70 75 80 

Leu lie Pro Val Leu Arg Thr Ala Cys Glu Arg Pro Arg lie Pro Gly 
85 90 95 

Leu Leu Glu Asn Leu Val Ala Met lie Lys Arg Asn Met Asn Thr Pro 
100 105 110 

Asp Leu Ala Gly Thr Val Asp lie Thr Asn Met Ser lie Ser lie Val 
115 120 125 

Asp Asn Phe Phe Ser Ser Phe Val Arg Asp Glu Val Leu Leu Asp His 
130 135 ' 140 

Leu Asp Cys Val Arg Ala Ser Ser lie Gin Ser Phe Ser Asp Trp Phe 
145 150 155 160 



Ser Cys Gin Pro Thr Ser Ala Val Gly Gin Leu Ala Asn Phe Asn Phe 
165 170 175 
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lie Asp Leu Pro 
180 

Pro Lys Ser Arg 
195 

Gin Thr He Val 
210 

Val Phe Lys Tyr 
225 

Lys Phe Phe Phe 



Phe Ser Asp Leu 
260 

Val Ser Lys Tyr 
275 

Met Ala He Trp 
290 

Trp Ser Met Gly 

305 

He Lys Thr Leu 



Phe He Gly Asn 

340 

Pro Leu Asp Lys 
355 

He Tyr Leu Pro 
370 

Asn Leu Val Trp 
385 

Tyr Phe Cys Gly 



Tyr Pro Asp Pro 
420 

Val Gly Tyr Glu 

435 

Ala His Ser Leu 

450 

He His Glu Leu 
465 

Cys Leu Cys Lys 



Ala Phe Asp Thr 



Leu Asp Thr Ser 
200 

Tyr His Pro Lys 
215 

Leu Thr Thr Lys 

230 

Tyr Thr Arg Lys 
245 

Ser Ser His Ser 



Asp Lys Ser Gin 
280 

Glu Lys Leu Gly 
295 

His Lys Arg Thr 
310 

He Tyr Tyr Gin 
325 

Thr Phe He He 



Cys Phe Lys Ala 
360 

Lys Gly Leu Glu 
375 

Asn Phe Glu Ala 
390 

Lys Tyr He He 
405 

Leu Lys Leu He 



His Val Glu Glu 
440 

Phe Asn Gly Ala 
455 

Phe Pro Asn Ala 
470 

Tyr Leu Ser Asp 
485 



Tyr Met His Met 
185 

He Gin Ser Glu 



Val Val Asn Ala 
220 

Phe Leu Ser Met 

235 

Lys Pro Glu Asp 
250 

Asp Tyr Glu He 
265 

Ser Asp Phe His 



Leu Asp Asp He 
300 

He Leu Gin Asp 
315 

Arg Lys Ser Gly 
330 

Ala Ala Cys Val 
345 

Ser Phe Cys Gly 



Tyr Pro Asp He 
380 

Lys Leu Phe Arg 
395 

His His Ala Asn 
410 

Ser Lys Leu Gly 
425 

Phe Arg He Ser 



Tyr Phe His Leu 

460 

Gly Gly Cys Ser 
475 

Lys Arg Leu Phe 
490 



He Lys Arg Gin 
ISO 

Tyr Pro Ala Leu 
205 

Val Phe Gly Pro 



Val Asp Ser Ser 

240 

Leu Gin Glu Phe 
255 

Leu Glu Leu Asp 
270 

Phe Ser He. Glu 
285 

Leu Ala Trp Met 



Phe Gin Ala Gly 
320 

Asp Val Thr Thr 
335 

Ala Ser Met Lsu 

350 

Asp Asp Ser Leu 
365 

Gin Ala Thr Ala 



Lys Lys Tyr Gly 
400 

Gly Cys He Val 
415 

Asn Lys Ser Leu 
430 

Leu Leu Asp Val 
445 

Leu Asp Asp Ala 



Phe Val He Asn 
480 

Arg Ser Leu Tyr 
495 



He Asp Val Ser Lys 
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500 



<210> 21 
<211> 4932 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to 18 6 kD protein of CGMMV 
strain SH 

<400> 21 

atggcaaaca ttaatgaaca aatcaacaac caacgtgacg ccgcggctag cgggagaaac 60 

aatctcgtta gccaattggc gtcaaaaagg gtgtatgacg aggctgttcg ctcgttggat 120 

catcaagaca gacgcccgaa aatgaatttt tctcgtgtgg tcagcacaga gcacaccagg 180 

cttgtaactg acgcgtatcc ggagttttcg attagcttta ccgccaccaa gaactctgta 240 

cactcccttg cgggtggtct gaggcttctc gaattggaat anatgatgat gcaggtgccc 300 

tacggctcac cttgttatga catcggcggi: aactatacgc agcacttgtt caaaggtaga 360 

tcatatgtgc attgctgcaa tccgtgccta gatcttaagg atgttgcgag gaatgtgatg 420 

tacaacgata tgattacgca acatgtacag aggcacaagg gatcttgcgg gtgcagacct 480 

cttccaactt tccagataga tgcattcagg aggtacgata gttctccctg tgcggtcacc 540 

tgttcagacg ttttccaaga gtgttcctat gattttggga gtggtaggga taatcatgca 600 

gtctcgttgc attcaatcta cgatatccct tattcttcga tcggacctgc tcttcatagg 660 

aagaatgtgc gagtttgtta tgcagccttt catttctcgg aggcattgct tttaggttcg 720 

cctgtaggta atttaaatag tattggggct cagtttaggg tcgatggtga tgatgtgcat 780 

tttcttttta gtgaagagtc tactttgcat tatactcata gtttagaaaa tatcaagtta 840 

atcgtgatgc gtacttactt tcctgctgat gataggtttg tatatattaa ggagttcatg 900 

gttaagcgtg tggatacttt tttctttagg ttggtcagag cagatacaca catgcttcat 960 

aaatctgtgg ggcactattc gaaatcgaag tctgagtact tcgcgctgaa tacccctccg 1020 

atcttccaag ataaagccac grtrtctgtc tgqttrcc-g aagcgaagaa ggtgttgata 1080 

cccaagtttg aactttcgag attcctttct gggaatgtga aaatctctag gatgcttgtc 1140 

gatgctgatt tcgtccatac cattattaat cacattagca cgtatgataa caaggcctta 1200 

gtgtggaaga atgttcagtc ctttgtggaa tccatacgtt caagagtaat tgtaaacgga 1260 

gtttccgtga aatctgagtg gaacgtaccg gttgatcagc rcactgatat ctcgttctcg 1320 

atattccctc tcgtgaaggt taggaaggta cagatcgagt raatgtctga taaagttgta 1380 

atcgaggcga ggggtttgct tcggaggttc gcagacagtc traaatctgc cgtagaagga 1440 

ctaggtgatt gcgtctatga tgctctagtt caaaccggct ggrttgacac ctctagcgac 1500 

gaactgaaag tattgctacc tgaaccgttt atgacctttt cggattatct tgaagggatg 1560 

tacgaggcag atgcaaagat cgagagagag agtgtctctg agttgcccgc ttccggtgat 1620 

gatttgttca agaaaatcga tgagataaga aacaattaca gtggagtcga atttgatgta 1680 

gagaaattcc aagaattttg caaggaactg aatgttaatc cratgctaat tggccatgtc 1740 

atcgaagcta ttttttcgca gaaggctggg gtaacagtaa cgggtctggg cacgctctct 1800 

cctgagatgg gcgcttctgt tgcgttatcc agtacctctg tagatacatg tgaagatatg 1860 

gatgtaactg aagatatgga ggatatagtg ttgatggcgg acaagagtca ttcttacatg 1920 

tcccctgaaa tggcgagatg ggctgatgtt aaatatggca acaataaagg ggctctagtc 1980 

gagtacaaag tcggaacctc gatgacttta cctgccacct gggcagagaa agttaaggct 2040 

gtcttaccgt tgtcggggat ctgtgtgagg aaaccccaat tttcgaagcc gcttgatgag 2100 

gaagatgact tgaggttatc aaacatgaat ttctttaagg tgagcgatct aaagttgaag 2160 

aagactatca ctccagtcgt ttacactggg accattcgag agaggcaaat gaagaattat 2220 

attgattact tatcggcctc tcttggttcc acgctgggta atctggagag aatcgtgcgg 2280 

agtgattgga atggtactga ggagagtatg caaacgttcg ggttgtatga ctgcgaaaag 2340 

tgcaagtggt tattgttgcc agccgagaag aagcacgcat gggccgtggt tctggcaagt 2400 

gacgatacca ctcgcataat cttcctttca tatgacgaat crggttctcc tataattgat 2460 

aagaaaaact ggaagcgatt tgctgtctgt tccgagacca aagrctatag tgtaattcgt 2520 

agcttagagg ttctaaataa ggaagcaata gtcgaccccg gggttcacat aacattagtt 2580 

gacggagtgc cgggttgtgg aaagaccgcc gagattatag cgagggtcaa ttggaaaact 2 64 0 

gatctagtat tgactcccgg aagggaggca gctgctatga ttaggcggag agcctgcgcc 2700 

ctgcacaagt cacctgtggc aaccaatgac aacgtcagaa ctttcgattc ttttgtgatg 2760 
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aataggaaaa 
ggattactta 
gctaagcaaa 
actttaatag 
actagttttc 
tctgtgaagg 
aaaggaaaga 
aatgatgtga 
cgtgccaccc 
ttaactaggc 
ataatagcgg 
cctaccaaaa 
acggggtttt 
gtactaaatg 
ccttgtaggc 
cagaattttc 
cttgagaatc 
gtagatataa 
gacgaggttt 
gattggtttt 
gatttgcctg 
gatacttcga 
gtaaatgcag 
gatagttcta 
tcagatctct 
aagtcgcaat 
gacgatattt 
caagccggga 
ataggtaata 
tttaaagcta 
cctgatatac 
aagtatggtt 
cctgaccctt 
gttgaggagt 
ttccatttac 
gtaattaatt 
gatgtctcta 



tcttcaagtt 
attttgcgtt 
tcccgtttat 
tcgataatgt 
ttaatactat 
caattaaagt 
taataacgtt 
atactgtgca 
cgactccaat 
acactaaggc 
atgtggaaaa 
tgcagaattc 
atacagacat 
atttcgatgc 
taaccttgag 
tgatccccgt 
ttgtagctat 
ctaacatgtc 
tgcttgatca 
cgtgtcaacc 
cctttgatac 
ttcagtctga 
tttttggtcc 
agtttttctt 
cttcccattc 
ccgatttcca 
tggcttggat 
taaagacgct 
cctttattat 
gtttttgtgg 
aggctactgc 
acttctgcgg 
taaaattaat 
ttcgtatatc 
tcgacgatgc 
gtttgtgtaa 
ag 



tgacgctgtc 
aaagatctca 
aaacagagtc 
agagcgtagg 
ctataaagcc 
gtcaggggcc 
tactcaatct 
tgaaattcag 
aggtttgatt 
aatggtgtat 
ggtcgatcag 
gctgtatgtc 
gcaggagttc 
cgtaaccatg 
taatttagat 
tttgcgtacg 
gataaagagg 
gatttctata 
cttagattgt 

ttatatgcat 
atacccggcc 
ggttttcaag 
ttacactagg 
tgattatgag 
cttctctat u 

catttactat 
cgcagcgtgt 
tgatgattcg 
caaccttgt t 
gaagtatata 
tagtaaatta 
tctcctcgac 
aatccacgaa 
gtatttgagt 



tatgttgacg 
ggttgtaaaa 
atgaatttcg 
tatgtcaccc 
gctgtcgcta 
ggtattctga 
gataagcagt 
ggagaaacct 
gcccgtgatt 
tatactgttg 
tcg3tcttga 
catcgtaara 
tacgatagat 
cggttgaggg 
ccggtacccg 
gcctgtgaaa 
aatatgaata 
gtagataact 
gttagggcta 
gr tggccagt 
atgattaaga 
ttgcaaacta 
tatttaacca 
aaaaaaccag 
attcttgagc 
gagatggcaa 
ggtcacaaaa 
caacggaagt 
gtggctagta 
ctgatctacc 
tgcaattttg 
attcaccatg 
ggtaataaga 
gttgctcata 
ttatttccta 
gataagcgcc 



agggtctgat 
aagccttcgt 
attatcctaa 
ataggtgtcc 
ctactagtcc 
ggcctgagtt 
ccttgatcaa 
ttgaggagac 
caccacatgt 
tattcgatgc 
ccatgtttgc 
ttttcctccc 
gccttcctgg 
acaacgaatt 
ctttgattaa 
ggccgcgcat 
ctcctgattt 
tcttttcttc 
gttccattca 
tagctaattt 
ggcaacccaa 
ttgtttatca 
ccaagtttct 
aagatctgca 
ttgatgtttc 
tttgggaaaa 
gaactatact 
ctggtgatgt 
tgttgccgtt 
ttcctaaggg 
aggcgaaact 
ccaacggctg 
gtcttgtagg 
gtttgtttaa 
atgctggggg 
ttttccgtag 



ggtccatacg 
ctttggtgat 
ggagttaaga 
tagagatgtc 
ggttgtacat 
gacaaagatc 
gagtgggtac 
ggcagttgtg 
actagtggcc 
agttacaagt 
taccactgtg 
tgttagtaaa 
gaattccttc 
taacttacaa 
gaatgaagcg 
tccgggtctt 
agctgggacc 
ttttgttagg 
aagtttttct 
caatttcata 
gagtcggtta 
ccctaaagtg 
tagtatggta 
ggaatttttc 
taaatatgac 
attagggctt 
gcaagatttc 
aactactttt 
agataagtgt 
tttggagtat 
tttccgaaag 
tattgtttac 
gtatgagcat 
tggtgcttat 
ttgcagtttt 
tctttacata 



2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4932 



<210> 22 
<211> 1644 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: amino acid 

sequence corresponding to 186 kD protein of CGMMV 
strain SH 

<400> 22 

Met Ala Asn lie Asn Glu Gin lie Asn Asn Gin Arg Asp Ala Ala Ala 
15 10 15 

Ser Gly Arg Asn Asn Leu Val Ser Gin Leu Ala Ser Lys Arg Val Tyr 
20 25 30 

Asp Glu Ala Val Arg Ser Leu Asp His Gin Asp Arg Arg Pro Lys Met 

35 40 45 



Asn Phe Ser Arg Val Val Ser Thr Glu His Thr Arg Leu Val Thr Asp 
50 55 60 
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Ala Tyr Pro Glu 

65 

His Ser Leu Ala 



Met Gin Val Pro 
100 

Thr Gin His Leu 
115 

Cys Leu Asp Leu 
130 

He Thr Gin His 
145 

Leu Pro Thr Phe 



Cys Ala Val Thr 

180 

Gly Ser Gly Arg 
195 

He Pro Tyr Ser 
210 

Val Cys Tyr Ala 
225 

Pro Val Gly Asn 



Asp Asp Val His 
250 

His Ser Leu Glu 
275 

Ala Asp Asp Arg 

290 

Asp Thr Phe Phe 
305 

Lys Ser Val Gly 



Asn Thr Pro Pro 
340 

Pro Glu Ala Lys 
355 

Leu Ser Gly Asn 
370 



Phe Ser He Ser 

70 

Gly Gly Leu Arg 
85 

Tyr Gly Ser Pro 



Phe Lys Gly Arg 
120 

Lys Asp Val Ala 
135 

Val Gin Arg His 
150 

Gin He Asp Ala 
165 

Cys Ser Asp Val 



Asp Asn His Ala 
200 

Ser He Gly Pro 
215 

Ala Phe His Phe 
230 

Leu Asn Ser He 
245 

Phe Leu Phe Ser 



Asn He Lys Leu 
280 

Phe Val Tyr He 

295 

Phe Arg Leu Val 
310 

His Tyr Ser Lys 
325 

He Phe Gin Asp 



Lys Val Leu He 
360 

Val Lys He Ser 
375 



30 



Phe Thr Ala Thr 

75 

Leu Leu Glu Leu 
90 

Cys Tyr Asp He 
105 

Ser Tyr Val His 



Arg Asn Val Met 
140 

Lys Gly Ser Cys 
155 

Phe Arg Arg Tyr 
170 

Phe Gin Glu Cys 

185 

Val Ser Leu His 



Ala Leu His Arg 
220 

Ser Glu Ala Leu 
235 

Gly Ala Gin Phe 
250 

Glu Glu Ser Thr 
265 

He Val Met Arg 



Lys Glu Phe Met 

300 

Arg Ala Asp Thr 
315 

Ser Lys Ser Glu 
330 

Lys Ala Thr Phe 
345 

Pro Lys Phe Glu 



Arg Met Leu Val 
380 



Lys Asn Ser Val 

80 

Glu Tyr Met Met 
95 

Gly Gly Asn Tyr 
110 

Cys Cys Asn Pro 
125 

Tyr Asn Asp Met 



Gly Cys Arg Pro 
160 

Asp Ser Ser Pro 
175 

Ser Tyr Asp Phe 
190 

Ser He Tyr Asp 
205 

Lys Asn Val Arg 



Leu Leu Gly Ser 
240 

Arg Val Asp Gly 
255 

Leu His Tyr Thr 
270 

Thr Tyr Phe Pro 
285 

val Lys Arg Val 



Kis Met Leu His 
320 

Tyr Phe Ala Leu 
335 

Ser Val Trp Phe 
350 

Leu Ser Arg Phe 
365 

A.sp Ala Asp Phe 
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Val His Thr He 
385 

Val Trp Lys Asn 



He Val Asn Gly 

420 

Gin Leu Thr Asp 

435 

Lys Val Gin He 
450 

Gly Leu Leu Arg 
465 

Leu Gly Asp Cys 



Thr Ser Ser Asp 
500 

Phe Ser Asp Tyr 
515 

Arg Glu Ser Val 
530 

Lys He Asp Glu 
545 

Glu Lys Phe Gin 



He Gly His Val 
580 

Val Thr Gly Leu 
595 

Leu Ser Ser Thr 
610 

Asp Met Glu Asp 
625 

Ser Pro Glu Met 



Gly Ala Leu Val 
660 

Thr Trp Ala Glu 
675 

Val Arg Lys Pro 
690 



He Asn His He 
390 

Val Gin Ser Phe 
405 

Val Ser Val Lys 



He Ser Phe Ser 
440 

Glu Leu Met Ser 
455 

Arg Phe Ala Asp 
470 

Val Tyr Asp Ala 
485 

Glu Leu Lys Val 



Leu Glu Gly Met 
520 

Ser Glu Leu Leu 
535 

He Arg Asn Asn 
550 

Glu Phe Cys Lys 
565 

He Glu Ala He 



Gly Thr Leu Ser 
600 

Ser Val Asp Thr 
615 

He Val Leu Met 
630 

Ala Arg Trp Ala 
645 

Glu Tyr Lys Val 



Lys Val Lys Ala 
680 

Gin Phe Ser Lys 
695 



Ser Thr Tyr Asp 
395 

Val Glu Ser He 
410 

Ser Glu Trp Asn 
425 

He Phe Pro Leu 



Asp Lys Val Val 
460 

Ser Leu Lys Ser 
475 

Leu Val Gin Thr 
490 

Leu Leu Pro Glu 
505 

Tyr Glu Ala Asp 



Ala Ser Gly Asp 
540 

Tyr Ser Gly Val 
555 

Glu Leu Asn Val 
570 

Phe Ser Gin Lys 
585 

Pro Glu Met Gly 



Cys Glu Asp Met 
620 

Ala Asp Lys Ser 
635 

Asp Val Lys Tyr 

650 

Gly Thr Ser Met 
665 

Val Leu Pro Leu 



Pro Leu Asp Glu 
700 



Asn Lys Ala Leu 
400 

Arg Ser Arg Val 
415 

Val Pro Val Asp 
430 

Val Lys Val Arg 

445 

He Glu Ala Arg 



Ala Val Glu Gly 
480 

Gly Trp Phe Asp 
495 

Pro Phe Met Thr 
510 

Ala Lys He Glu 
525 

Asp Leu Phe Lys 



Glu Phe Asp Val 
560 

Asn Pro Met Leu 
575 

Ala Gly Val Thr 
590 

Ala Ser Val Ala 
605 

Asp Val Thr Glu 



His Ser Tyr Met 
640 

Gly Asn Asn Lys 

655 

Thr Leu Pro Ala 
670 

Ser Gly He Cys 
685 

Glu Asp Asp Leu 



Arg Leu Ser Asn Met Asn Phe Phe Lys Val Ser Asp Leu Lys Leu Lys 
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705 

Lys Thr 



710 



715 



720 



He Thr Pro Val Val Tyr Thr Gly Thr He Arg Glu Arg Gin 
725 730 735 



Met Lys Asn Tyr He Asp Tyr Leu Ser Ala Ser Leu Gly Ser Thr Leu 
740 745 750 

Gly Asn Leu Glu Arg He Val Arg Ser Asp Trp Asn Gly Thr Glu Glu 
755 760 765 

Ser Met Gin Thr Phe Gly Leu Tyr Asp Cys Glu Lys Cys Lys Trp Leu 
770 775 780 

Leu Leu Pro Ala Glu Lys Lys His Ala Trp Ala Val Val Leu Ala Ser 
785 790 795 800 

Asp Asp Thr Thr Arg He He Phe Leu Ser Tyr Asp Glu Ser Gly Ser 

805 810 ' 815 

Pro He He Asp Lys Lys Asn Trp Lys Arg Phe Ala Val Cys Ser Glu 
820 825 830 

Thr Lys Val Tyr Ser Val He Arg Ser Leu Glu Val Leu Asn Lys Glu 
835 840 845 

Ala He Val Asp Pro Gly Val His He Thr Leu Val Asp Gly Val Pro 
850 855 860 

Gly Cys Gly Lys Thr Ala Glu He He Ala Arg Val Asn Trp Lys Thr 
865 870 875 880 

Asp Leu Val Leu Thr Pro Gly Arg Glu Ala Ala Ala Met He Arg Arg 
885 890 895 

Arg Ala Cys Ala Leu His Lys Ser Pro Val Ala Thr Asn Asp Asn Val 
900 905 910 

Arg Thr Phe Asp Ser Phe Val Met Asn Arg Lys He Phe Lys Phe Asp 

915 920 925 

Ala Val Tyr Val Asp Glu Gly Leu Met Val His Thr Gly Leu Leu Asn 
930 935 940 

Phe Ala Leu Lys He Ser Gly Cys Lys Lys Ala Phe Val Phe Gly Asp 
945 950 955 960 

Ala Lys Gin He Pro Phe He Asn Arg Val Met Asn Phe Asp Tyr Pro 
965 970 975 

Lys Glu Leu Arg Thr Leu He Val Asp Asn Val Glu Arg Arg Tyr Val 
980 985 990 

Thr His Arg Cys Pro Arg Asp Val Thr Ser Phe Leu Asn Thr He Tyr 
995 1000 1005 

Lys Ala Ala Val Ala Thr Thr Ser Pro Val Val His Ser Val Lys Ala 
1010 1015 1020 



He Lys Val Ser Gly Ala Gly He Leu Arg Pro Glu Leu Thr Lys He 
1025 1030 1035 1040 
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Lys Gly Lys He He Thr Phe Thr Gin Ser Asp Lys Gin Ser Leu He 
1045 1050 * 1055 

Lys Ser Gly Tyr Asn Asp Val Asn Thr Val His Glu He Gin Gly Glu 
1060 1065 1070 

Thr Phe Glu Glu Thr Ala Val Val Arg Ala Thr Pro Thr Pro He Gly 
1075 1080 1085 

Leu He Ala Arg Asp Ser Pro His Val Leu Val Ala Leu Thr Arg His 
1090 1095 1100 

Thr Lys Ala Met Val Tyr Tyr Thr Val Val Phe Asp Ala Val Thr Ser 
1105 1110 1115 1120 

He He Ala Asp Val Glu Lys Val Asp Gin Ser He Leu Thr Met Phe 
1125 1130 1135 

Ala Thr Thr Val Pro Thr Lys Met Gin Asn Ser Leu Tyr Val His Arg 
1140 1145 1150 

Asn He Phe Leu Pro Val Ser Lys Thr Gly Phe Tyr Thr Asp Met Gin 
1155 1160 1165 

Glu Phe Tyr Asp Arg Cys Leu Pro Gly Asn Ser Phe Val Leu Asn Asp 
1170 1175 1180 

Phe Asp Ala Val Thr Met Arg Leu Arg Asp Asn Glu Phe Asn Leu Gin 
1185 1190 1195 1200 

Pro Cys Arg Leu Thr Leu Ser Asn Leu Asp Pro Val Pro Ala Leu He 
1205 1210 1215 

Lys Asn Glu Ala Gin Asn Phe Leu He Pro Val Leu Arg Thr Ala Cys 
1220 1225 1230 

Glu Arg Pro Arg He Pro Gly Leu Leu Glu Asn Leu Val Ala Met He 
1235 1240 1245 

Lys Arg Asn Met Asn Thr Pro Asp Leu Ala Gly Thr Val Asp He Thr 
1250 1255 1260 

Asn Met Ser He Ser He Val Asp Asn Phe Phe Ser Ser Phe Val Arg 

1265 1270 1275 1280 

Asp Glu Val Leu Leu Asp His Leu Asp Cys Val Arg Ala Ser Ser He 
1285 1290 1295 

Gin Ser Phe Ser Asp Trp Phe Ser Cys Gin Pro Thr Ser Ala Val Gly 
1300 1305 1310 

Gin Leu Ala Asn Phe Asn Phe He Asp Leu Pro Ala Phe Asp Thr Tyr 
1315 1320 1325 

Met His Met He Lys Arg Gin Pro Lys Ser Arg Leu Asp Thr Ser He 
1330 1335 1340 



Gin Ser Glu Tyr Pro Ala Leu Gin Thr He Val Tyr His Pro Lys Val 
1345 1350 1355 1360 
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Val Asn Ala Val Phe Gly Pro Val Phe Lys Tyr Leu Thr Thr Lys Phe 
1365 1370 1375 

Leu Ser Met Val Asp Ser Ser Lys Phe Phe Phe Tyr Thr Arg Lys Lys 
1380 1385 1390 

Pro Glu Asp Leu Gin Glu Phe Phe Ser Asp Leu Ser Ser His Ser Asp 
1395 1400 1405 

Tyr Glu lie Leu Glu Leu Asp Val Ser Lys Tyr Asp Lys Ser Gin Ser 
1410 1415 1420 

Asp Phe His Phe Ser lie Glu Met Ala lie Trp Glu Lys Leu Gly Leu 
1425 1430 1435 1440 

Asp Asp He Leu Ala Trp Met Trp Ser Met Gly His Lys Arg Thr He 
1445 1450 1455 

Leu Gin Asp Phe Gin Ala Gly He Lys Thr Leu He Tyr Tyr Gin. Arg 
1460 1465 1470 

Lys Ser Gly Asp Val Thr Thr Phe He Gly Asn Thr Phe He He Ala 
1475 1480 1485 

Ala Cys Val Ala Ser Met Leu Pro Leu Asp Lys Cys Phe Lys Ala Ser 
1490 1495 1500 

Phe Cys Gly Asp Asp Ser Leu He Tyr Leu Pro Lys Gly Leu Glu Tyr 
1505 1510 1515 1520 

Pro Asp He Gin Ala Thr Ala Asn Leu Val Trp Asn Phe Glu Ala Lys 
1525 1530 1535 

Leu Phe Arg Lys Lys Tyr Gly Tyr Phe Cys Gly Lys Tyr He He His 
1540 1545 1550 

His Ala Asn Gly Cys He Val Tyr Pro Asp Pro Leu Lys Leu He Ser 
1555 1560 1565 

Lys Leu Gly Asn Lys Ser Leu Val Gly Tyr Glu His Val Glu Glu Phe 
1570 1575 1580 

Arg He Ser Leu Leu Asp Val Ala His Ser Leu Phe Asn Gly Ala Tyr 
1585 1590 1595 1600 

Phe His Leu Leu Asp Asp Ala He His Glu Leu Phe Pro Asn Ala Gly 
1605 1610 1615 

Gly Cys Ser Phe Val He Asn Cys Leu Cys Lys Tyr Leu Ser Asp Lys 
1620 1625 1630 



Arg Leu Phe Arg Ser Leu Tyr He Asp Val Ser Lys 
1635 1640 



<210> 23 

<211> 1139 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: nucleotide 

sequence corresponding to sequence encoding coat 
protein of CGMMV strain SH 



<400> 23 

aattcggctt ctgtaggggt ggtgctactg ttgctctggt tgacacaagg atgcattctg 60 

ttgcagaggg aactatatgc aaattttcag ctcccgccac cgtccgcgaa ttctctgtta 120 

ggttcatacc taattatcct gtcgtggctg cggatgccct tcgcgatcct tggtctttat 180 

ttgtgagact ctctaatgtg ggcattaaag atggtttcca tcctttgact ttagaggtcg 240 

cttgtttagt cgctacaact aactctatta tcaaaaaggg tcttagagct tctgtagtcg 300 

agtctgtcgt ctcttccgat cagtctattg tcctagattc cttgtccgag aaagttgaac 360 

ctttctttga caaagttcct atttcagcgg ctgtaatggc aagagatccc agttataggt 420 

ctaggtcaca gtctgtcggt ggtcgtggta agcggcattc taaacctcca aatcggaggt 480 

tggactctgc ttctgaagag tccagttctg tttcttttga agatggctta caatccgatc 540 

acacctagca aacttattgc gtttagtgct tcatatgttc ccgtcaggac tttacttaat 600 

tttctagttg cttcacaagg taccgctttt cagactcaag cgggaagaga ttctztccgc 660 

gagtccctgt ctgcgttacc ctcgtctgtc gtagatatta attctagatt cccagatgcg 720 

ggtttttacg ctttcctcaa cggtcctgtg ttgaggccta Ticttcgtttc gcttctcagc 780 

tccacggata cgcgtaatag ggtcartgag gttgtagatc ctagcaatcc tacgactgct 840 

gagtcgctta acgctgtaaa gcgtactgat gacgcgtcta cagccgctag ggccgagata 900 

gataatttaa tagagtctat ttctaagggt tttgatgttt acgatagggc ttcatttgaa 960 

gccgcgtttt cggtagtctg gtcagaggct accacctcga aagcttagtt tcgagggtct 1020 

tctgatggtg gtgcacacca aagtgcatag tgctttcccg ttcacttaaa tcgaacggtt 1080 

tgctcattgg tttgcggaaa cctctcacgt gtgacgttga agtttctatg ggcaagccg 1139 



<210> 24 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 97G01 
<400> 24 

aggtgtcagt ggagaactca ttga 24 



<210> 25 
<211> 17 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 97G02 
<400> 25 

ggcgttgtgg tttgtgg 17 



<210> 26 
<211> 21 

<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 97G03 



<400> 26 

ctgtaggggt ggtgctactg t 



21 
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36 



<210> 27 
<211> 21 
5 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 97G18 

10 

<400> 27 . 

gcccatagaa acttcaacgt c 21 

15 <210> 28 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
20 <220> 

<223> Description of Artificial Sequence: primer 98A88 
<400> 28 

ccatggagaa ttcgctgtat gtcc 24 



<210> 29 
<211> 23 
<212> DNA 
30 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 98A86 
35 <400> 29 

cgagctctcg actgacacct tac 23 



<210> 30 
40 <211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

45 <223> Description of Artificial Sequence: primer 98AB4 



25 



<400> 30 

ccatggcaaa cattaatgaa c 



21 



50 



<210> 31 
<211> 21 
<212> DNA 



<213> Artificial Sequence 



55 



<220> 

<223> Description of Artificial Sequence: primer98A85 



60 



<400> 31 

caaccatggc aaacattaat g 



21 
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<210> 32 
<211> 22 
<212> DNA 
5 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 98G63 

10 <400> 32 

taacagggag gaaaatatta eg 22 



<210> 33 
15 <211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

20 <223> Description of Artificial Sequence: primer 98L99 
<400> 33 

gagctcggat ccactagtaa cggc 24 

25 

<210> 34 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 98L107 
<400> 34 

35 tagagctctt gaagctaagc aaattccg 28 

<210> 35 
<211> 31 
40 <212> DNA 

<213> Artificial Sequence 



30 



45 



<220> 

<223> Description of Artificial Sequence: primer 98L108 
<400> 35 

ttcaagagct ctaatcaccg aagacaaagg c 31 



50 <210> 36 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
55 <220> 

<223> Description of Artificial Sequence: primer 98L102 

<400> 36 

gaattatatc gattatctat cggc 24 

60 
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38 



<210> 37 

<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 98L103 
<400> 37 

gataatcgat ataattcttc atctgcc 27 

<210> 38 
<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 98L104 



<210> 39 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 98L105 

<400> 39 

aattactagt ttccggaagc aagcagctca g 31 

<210> 40 
<211> 22 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 98L106 



<210> 41 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic 
sequence 97G40 



<400> 38 

aactagtaat tgatgatctg ttcaagaag 



29 



<400> 40 

gccctctaga tgcatgctcg ag 



22 



<400> 41 

ctagagtttt aatttttata attaaacaaa 



30 
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<210> 42 
<211> 36 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic 
sequence 97G41 

<400> 42 

tcaaaattaa aaatattaat ttgtttgttg ttgttg 36 

<210> 43 

<211> 36 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: synthetic 
sequence 97G42 

<400> 43 

caacaacaac aacaacaaac aattttaaaa caacac 36 

<210> 44 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic 
sequence 97G4 3 



<400> 44 

ttgttgtttg ttaaaatttt gttgtggtac 



30 



